0% found this document useful (0 votes)
47 views2 pages

Spotfire Skills for Data Analysts

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
47 views2 pages

Spotfire Skills for Data Analysts

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

Chaitanya Kaul

Associate Analyst- Network Operations


Email: xyz@[Link] | Phone: 999999999 | LinkedIn | GitHub

SUMMARY
Working as an Associate Analyst with over 6 months of experience in analyzing data with SQL, Python,
Tableau/Spotfire and Excel. Proficient knowledge in Statistics, Mathematics and other Analytics tools and
technologies.

EDUCATION
[Link]. in Data Science
Amity School of Engineering and Technology (ASET), Amity University, Gurugram
Cumulative GPA: 9.26/10 July 2019 – May 2021
B.E. in Information Technology
University Institute of Engineering and Technology (UIET), Panjab University, Chandigarh
Cumulative GPA: 6.91/10 July 2015 – May 2019

SKILLS
Programming: Python, R, SQL, MySQL, Hive, TensorFlow
BI Tools: Tableau, Power BI, MS-Excel
Relevant Courses: Machine Learning, Natural Language Processing, Probability and Statistics, Data
Analytics and Data Mining, Data Structures, Database Management System, Big Data Technologies

WORK EXPERIENCE
United Airlines Business Services Pvt. Ltd. – Gurugram, HR
Associate Analyst Apr 2021 – Present
• Worked on a project “Miss Connect Rates”, where aim was to analyze the miss connects happening at
different stations and to come up with a solution so that passengers do not miss their connecting flights.
Saved the miss connects by 2%.
• Extracting the data by executing SQL queries on Teradata SQL Assistant and Microsoft SQL Server.
• Responsible for analyzing the data pulled from databases and creating reports on MS-Excel.
• Responsible for creating visualizations using Tableau/Spotfire.
• Responsible for Automating the reports using Python scripting.

Exposys Data Labs – Bengaluru, KR


Data Science Intern Sep 2020 – Oct 2020
• Worked on a project “Customer Segmentation”. Visualized gender and age distribution and then analyze
their income and spending scores. Used K-means clustering, Hierarchical clustering and DBSCAN
clustering.

ACADEMIC PROJECTS
Air Quality Index Prediction: Regression problem statement, Collected the data by performing web
scraping. Performed feature engineering, feature selection, hyperparameter tuning, exploratory data
analysis, Compared the results given by various Machine Learning models and ANN. Best RMSE score
given by Random Forest Regressor = 38.85. Used the Flask framework for web application and deployed
the model on Heroku.
Cotton Plant Disease Prediction: Deep Learning classification problem statement where aim was to
classify whether the cotton plant is suffering from disease or not. Used transfer learning model VGG 19 for
modelling and Flask framework for web application, Accuracy achieved 94.6%.
Apple Stock price Prediction and Forecasting: Got the data from Tingo API. Created a stacked LSTM
RNN model to predict the stock price of the company for next 30 days by providing a time step of previous
100 days. Got test RMSE equal to 239.6.
Fraud Transaction Classification: Classifying whether a particular transaction is fraudulent or not.
Performed feature engineering, handling of missing data, handling imbalanced datasets, hyperparameter
optimization, checking correlations, feature selection. Performed Cross validations, Compared the
results given by different classification models. Best accuracy score was given by Random Forest
classifier = 94%.

CERTIFICATIONS
• Data Analysis with Python (IBM, Coursera)
• SQL for Data Science (IBM, Coursera)
• Neural Networks & Deep Learning ([Link], Coursera)
• Python for Data Science (IBM, Coursera)
• Fundamentals of Visualization with Tableau (UC DAVIS, Coursera)
• Microsoft Excel from Beginner to Advanced (Udemy)
• Machine Learning A–Z (Udemy)

Common questions

Powered by AI

Achieving a high accuracy score in fraud transaction classification is significant because it indicates the model's effectiveness in correctly identifying fraudulent activities, which is crucial for financial security. This was achieved through comprehensive feature engineering, handling imbalanced datasets, hyperparameter optimization, and rigorous cross-validation .

The combination of a M.Tech. in Data Science and a B.E. in Information Technology supports proficiency in data science tools and methods. This educational background is further enhanced by certifications in 'Data Analysis with Python', 'SQL for Data Science', 'Neural Networks & Deep Learning', and 'Python for Data Science', which provide additional specialized skills .

Academic projects such as the 'Air Quality Index Prediction' and 'Apple Stock price Prediction and Forecasting' contribute to skills development in prediction modeling by providing hands-on experience with regression problems, feature engineering, model comparison, and deployment. These projects involve applying advanced machine learning techniques and evaluating model performance, thereby enhancing practical understanding and problem-solving skills in prediction modeling .

Certifications in Tableau and Excel enhance data visualization skills by providing structured learning on how to create and interpret complex data visualizations. These certifications cover the fundamentals and advanced functionalities of data visualization tools, enabling professionals to effectively communicate data insights and make data-driven decisions .

Machine learning and statistical skills integrate in projects to improve outcomes by enabling data scientists to perform sophisticated data analysis and predictive modeling. Statistical skills help in understanding data distributions and designing experiments, while machine learning provides tools to build predictive models and automate decision-making processes, as seen in projects like 'Air Quality Index Prediction' and 'Fraud Transaction Classification' which involved complex data manipulation and model evaluations .

The objective of the 'Miss Connect Rates' project was to analyze the reasons behind missed connections at different stations and devise solutions to minimize such occurrences. The outcome was a 2% reduction in missed connections, indicating improved operational efficiency .

Cross-validation plays a critical role in evaluating classification models by providing a more reliable assessment of a model's accuracy and generalizability across different datasets. It helps mitigate the risk of overfitting by using multiple splits of the data to ensure that the model performs well on unseen data, which is essential for achieving robust model performance .

Python scripting was used in projects involving report automation and data analysis. Specifically, in the 'United Airlines Business Services' project, Python was utilized to automate reports, streamlining the data analysis process and improving efficiency .

K-means clustering can facilitate customer segmentation in a business context by grouping customers into clusters based on similar attributes such as age, spending habits, and income level. This allows businesses to tailor marketing strategies and services to different customer segments, improving customer satisfaction and business outcomes .

The 'Cotton Plant Disease Prediction' project utilized transfer learning with the VGG 19 model, a deep learning framework, to classify whether cotton plants were diseased. The project achieved an accuracy rate of 94.6%, demonstrating effective application of deep learning techniques in agricultural diagnostics .

You might also like