0% found this document useful (0 votes)

78 views3 pages

Engineering Data Mesh in Azure Cloud

Gireesh has over 3 years of experience developing software applications and implementing manual testing. He has strong skills in big data technologies like PySpark, Spark SQL, Azure Data Factory, and Azure Databricks. He has worked on projects involving data migration from Netezza to Azure SQL and building dashboards using Azure services. Gireesh is proficient in Python, SQL, and big data platforms and looks to take on challenging Azure and big data projects.

Uploaded by

raghu.k326

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

78 views3 pages

Engineering Data Mesh in Azure Cloud

Uploaded by

raghu.k326

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

gireeshchoudary2@gmail.

com
+91- 8123405641

Gireesh.K

Career Aspirations

Willing to take challenging assignments in Azure, Big data related Projects or service, and to work with latest
technologies in order to achieve Organizational and Individual goals.

Experience Summary

 Having 3+ years of experience developing and implementation of software applications along with
manual testing .
 Good understanding of big data ecosystem and Azure Cloud system.

 Having 2+ years of Work experience with Pyspark, Spark SQL ,Azure data factory and Azure Databricks.

 Working with RDD’S, Data Frame’s in Pyspark frame works.

 Good work experience with Azure data factory, Azure Data bricks.

 Providing access to the users based on approval status.

 Working on daily, weekly, monthly and yearly store load activities.

 Providing the data extract reports based on the client requirements.

 Analyze and make the Code Changes the as per the business requirement

 Good understanding of Spark Architecture, Transformations and Actions.

 Adding and modifying the stores, providing the data extracts based on client need.

 Working with incidents and Tasks.

 Providing the Audit reports on daily basses.

 Excellent analytical, communication and mentoring skills prove an asset to Organization.

 Quick learner and ability to mingle with the working environment.

Professional experience :

 Working as a Software Engineer for INAVANTAGE SOLUTIONS Pvt Ltd from May 2019 to Feb 2023.
gireeshchoudary2@[Link]
+91- 8123405641

Education and Certifications :

[Link] (Mechanical) Visvesvaraya Technological

University. 2017

Technical Skills

Hardware / Platforms : Windows 7/10.

Technology : Azure data bricks, Azure data factory.
Databases : SQL, Azure SQL
Languages : Pyspark, SQL Server.

Relevant Project Experience

# [Link] Details:

Project Title QVC-Netezza retirement plan

Client Name QVC RETAIL GROUP,USA
Databases Azure SQL,SQL SERVER
Technologies Pyspark, ADF, Azure SQL, Azure Databricks.

QVC (short for "Quality Value Convenience") is an American free-to-air television network, and flagship shopping
channel specializing in televised home shopping, owned by Qurate Retail Group was setup using Azure Data
warehouse for Database Migration, systematic analysis and been used for extensive reporting to trace activities
within Transactional services, other activities and providing support as per business requirements.
Netezza retirement plan is a migration project in which transfer the data on premise netezza DB to azure cloud
and synapse as target

Roles and responsibilities:

1) Verifying and Validating Raw data after Receiving Netezza Database.

2) Working on daily, weekly, monthly and yearly data load activities.
3) Reprocessing the data if we found any duplicate issues
4) Providing the data extract reports based on the client requirements.
5) Analyze and make the Code Changes the as per the business requirement.
6) Adding and modifying the stores, providing the data extracts based on client need.
gireeshchoudary2@[Link]
+91- 8123405641

7) Working with incidents and Tasks.

8) Providing the Audit reports on daily basses.

# [Link] Details:

Project Title Data Product P101 Dashboards

Client Name Anglo American

Databases Azure SQL
Technologies Pyspark, ADF , Azure SQL

The solution is designed to connect to various on premises data sources and ingest the data in Azure Data Lake for
various sites using ADF. The data is transformed for various end points consumption such as Enterprise Data
Management, Data Warehouse and Azure Analysis Services and Power BI.

Roles and responsibilities:

1) Communication with Business Owner & Onsite Team in daily stand up and scrum call.
2) Designed and developed code and scripts in PySpark for Acquisition and Transformation from different sources.
3) Moved all Meta data files generated from various source systems to ADLS for further processing.
4) Imported data from different sources like ADLSs and BLOB for computation using Spark.
5) Implemented Spark using Python in Databricks utilizing Data Frames and Spark-SQL API for faster Processing of
data.
6) Used Avro, Json and Parquet data formats to store data into ADLSs.
7) Created data driven work flows for data movement and transformation using Data Factory.
8) Extracted huge files by using Azure Storage Explorer from the Data Lake.

Common questions

The implementation of Azure Data Factory and Databricks in large-scale data migration projects, like the Netezza retirement plan at QVC, presents the challenge of managing vast volumes of data being transferred from legacy systems to modern cloud environments, requiring effective validation and reprocessing to handle duplicates and maintain data integrity . Opportunities include leveraging Azure's advanced analytics capabilities for enhanced reporting and systematic analysis, and the ability to handle complex workloads through PySpark scripts for data acquisition and transformation, leading to improved efficiency in data processing .

PySpark complements Azure SQL by providing a powerful framework for data acquisition and transformation, enabling the processing of large-scale data with Spark-SQL API for efficient computation and analysis . Azure SQL, on the other hand, serves as a robust database for storing and managing structured data, facilitating seamless integration and efficient data querying in enterprise environments . Their combined use maximizes data processing efficiency, allowing for complex transformations and queries that drive business insights and decision-making .

Verification and validation of raw data are crucial in data migration projects such as the Netezza retirement plan to ensure accuracy, consistency, and completeness as data moves from legacy systems to new environments . This process helps identify and rectify errors or anomalies early, such as duplicates, verifying that data meets business requirements and preserving its integrity during the transition . It also minimizes the risk of data corruption or loss, which could otherwise impact operational efficiencies and strategic decision-making in business processes .

Strategies to effectively handle incidents and tasks in Azure-based projects include implementing automated monitoring and alerting systems to quickly identify and respond to issues, employing robust logging and diagnostic tools for thorough incident analysis and resolution . Regular training for team members on Azure best practices and cross-functional coordination can ensure swift proactive measures. Utilizing incident management frameworks, such as ITIL, tailored to Azure environments can streamline response processes. Additionally, maintaining detailed documentation and conducting post-incident reviews can help in understanding root causes and preventing recurrence .

Using data formats such as Avro, Json, and Parquet in Azure Data Lakes is significant because they offer different advantages for storing and processing data. Avro is ideal for serializing multiple records and supports schema evolution, which is useful in dynamic environments . Json is widely used due to its simplicity and flexibility in representing hierarchical data structures. Parquet is a columnar format that optimizes storage and query performance, especially for complex analytical workloads, making it well-suited for big data processing in Azure environments . These formats collectively enable efficient data storage and access, facilitating advanced data analytics and transformation processes.

Gireesh K's professional experience and skills align with the demands of Azure and big data projects through his comprehensive understanding of the big data ecosystem and Azure Cloud systems, which are crucial for managing and implementing modern data solutions . His practical experience with PySpark, Spark SQL, Azure Data Factory, and Databricks, as well as his familiarity with transforming and handling large datasets, positions him well for executing complex data projects . His strong analytical skills and ability to configure workflows and manage incidents further complement his suitability for working in dynamic Azure environments .

Spark-SQL API plays a critical role in processing large datasets in Databricks by providing a scalable and efficient interface for executing SQL queries on large distributed datasets. It enables developers to leverage the power of Spark's distributed computation engine to perform complex transformations and aggregations on data stored in various formats . The API's integration with DataFrames allows seamless interoperability with structured data in Azure Data Lake, facilitating faster data processing and analysis, which improves performance and efficiency in data-intensive applications .

Managing a data product dashboard project using Azure technologies involves several key responsibilities: engaging in daily communication with business owners and onsite teams to ensure alignment and progress, designing and developing code and scripts in PySpark for data acquisition and transformation, and managing the movement of metadata and file data into Azure Data Lake for processing . Additionally, it includes creating data-driven workflows using Azure Data Factory for efficient data movement and transformation, and utilizing various data formats like Avro, Json, and Parquet for data storage in Azure Data Lakes .

Leveraging Azure's integration capabilities with Power BI can significantly enhance decision-making in business environments by providing robust tools for data visualization and analytics . Azure can ingest and process data from various sources into centralized data stores like Azure SQL and Data Lake, enabling consolidated insights across the enterprise. Power BI utilizes this processed data to create interactive dashboards and reports, offering real-time analytics that are essential for strategic planning and operational efficiency. This integration ensures that stakeholders have access to actionable insights and can make informed decisions based on the latest data metrics .

The benefits of using Azure Data Lake Services (ADLS) for storing and extracting large files include its ability to handle massive volumes of varied data in a scalable, secure, and cost-effective manner, providing a single repository for structured, semi-structured, and unstructured data . ADLS facilitates easy access to data through various Azure tools and supports high-performance analytics workloads. However, challenges include ensuring data governance and compliance, managing access control for secure data sharing, and possibly facing complexities in integrating with existing data management systems . Proper planning and management are essential to address these challenges effectively.

Azure Data Engineer Profile Summary
No ratings yet
Azure Data Engineer Profile Summary
4 pages
Azure Data Engineer Resume Summary
No ratings yet
Azure Data Engineer Resume Summary
3 pages
Azure Data Engineer with PySpark Expertise
No ratings yet
Azure Data Engineer with PySpark Expertise
4 pages
EXL Data Engineering Expertise Summary
No ratings yet
EXL Data Engineering Expertise Summary
4 pages
Azure Data Engineer Resume Overview
No ratings yet
Azure Data Engineer Resume Overview
6 pages
Terraform HCTAO-003 Certification Summary
No ratings yet
Terraform HCTAO-003 Certification Summary
8 pages
Azure Data Engineer Resume Summary
No ratings yet
Azure Data Engineer Resume Summary
3 pages
Azure Data Engineer Resume: Anoop Kumar
No ratings yet
Azure Data Engineer Resume: Anoop Kumar
5 pages
Data Engineer CV: Snowflake & Azure Expertise
No ratings yet
Data Engineer CV: Snowflake & Azure Expertise
5 pages
Azure Data Engineer Resume
No ratings yet
Azure Data Engineer Resume
3 pages
Data Engineer Profile: Sai Charan Konjarla
No ratings yet
Data Engineer Profile: Sai Charan Konjarla
9 pages
IBM ClearCase End of Life Impact
No ratings yet
IBM ClearCase End of Life Impact
9 pages
Software Engineer with Big Data Expertise
No ratings yet
Software Engineer with Big Data Expertise
4 pages
Azure Data Engineer Resume Overview
100% (1)
Azure Data Engineer Resume Overview
2 pages
Azure Data Engineer Resume Summary
No ratings yet
Azure Data Engineer Resume Summary
4 pages
Azure Data Engineer Resume Summary
No ratings yet
Azure Data Engineer Resume Summary
4 pages
Data Engineering Expertise Overview
No ratings yet
Data Engineering Expertise Overview
4 pages
Neudesic Azure Data Engineer Profile
No ratings yet
Neudesic Azure Data Engineer Profile
6 pages
Big Data Engineer with Azure Expertise
No ratings yet
Big Data Engineer with Azure Expertise
3 pages
Azure Data Engineer Profile Summary
No ratings yet
Azure Data Engineer Profile Summary
3 pages
Data Engineer Resume: Azure & ETL Expert
No ratings yet
Data Engineer Resume: Azure & ETL Expert
4 pages
Divya Namdev: Azure Data Engineer Resume
No ratings yet
Divya Namdev: Azure Data Engineer Resume
3 pages
Data Engineering and Analytics Expertise
No ratings yet
Data Engineering and Analytics Expertise
8 pages
Sai Kiran Valluri: Azure Data Engineer Profile
No ratings yet
Sai Kiran Valluri: Azure Data Engineer Profile
5 pages
Azure Data Engineer Resume Summary
No ratings yet
Azure Data Engineer Resume Summary
4 pages
Senior Data Engineer Profile: Divya Namdev
No ratings yet
Senior Data Engineer Profile: Divya Namdev
3 pages
Azure Developer Resume: Venkatasuresh V.
No ratings yet
Azure Developer Resume: Venkatasuresh V.
2 pages
Azure Data Engineer & Databricks Expert
No ratings yet
Azure Data Engineer & Databricks Expert
8 pages
Data Engineer & Analyst Resume Overview
No ratings yet
Data Engineer & Analyst Resume Overview
2 pages
Azure Data Engineer Resume Summary
No ratings yet
Azure Data Engineer Resume Summary
4 pages
Azure Data Engineer Expertise Summary
No ratings yet
Azure Data Engineer Expertise Summary
5 pages
Data Engineer Resume - Rishabh Negi
No ratings yet
Data Engineer Resume - Rishabh Negi
2 pages
Resume 1
No ratings yet
Resume 1
2 pages
Azure Data Engineering Expertise
No ratings yet
Azure Data Engineering Expertise
6 pages
Abhinay Varma: Data Engineering Expertise
No ratings yet
Abhinay Varma: Data Engineering Expertise
6 pages
Senior Data Engineer Profile Summary
No ratings yet
Senior Data Engineer Profile Summary
5 pages
Azure Data Factory Developer Profile
No ratings yet
Azure Data Factory Developer Profile
5 pages
Azure Data Engineer Resume Summary
No ratings yet
Azure Data Engineer Resume Summary
3 pages
Data Engineer with Azure & Power BI Expertise
No ratings yet
Data Engineer with Azure & Power BI Expertise
4 pages
Ritika Agnihotri: Data Engineer Profile
No ratings yet
Ritika Agnihotri: Data Engineer Profile
3 pages
Azure Data Factory Developer Resume
No ratings yet
Azure Data Factory Developer Resume
5 pages
Ateeja Mohammed: Azure Data Engineer Profile
No ratings yet
Ateeja Mohammed: Azure Data Engineer Profile
2 pages
Senior Azure Data Engineer Resume
No ratings yet
Senior Azure Data Engineer Resume
4 pages
Data Engineer Resume: Azure & Big Data Skills
No ratings yet
Data Engineer Resume: Azure & Big Data Skills
3 pages
AI & Data Engineering Expertise Summary
No ratings yet
AI & Data Engineering Expertise Summary
6 pages
Azure Data Engineer with 5 Years Experience
No ratings yet
Azure Data Engineer with 5 Years Experience
3 pages
Anil Ravula: Azure Data Architect Resume
No ratings yet
Anil Ravula: Azure Data Architect Resume
12 pages
Azure Data Engineer Profile Summary
No ratings yet
Azure Data Engineer Profile Summary
2 pages
Data Engineer Resume: Azure & SQL Expertise
No ratings yet
Data Engineer Resume: Azure & SQL Expertise
4 pages
Data Engineer Resume: Big Data & ETL Skills
No ratings yet
Data Engineer Resume: Big Data & ETL Skills
7 pages
Davinder Gill's Data Engineering Expertise
No ratings yet
Davinder Gill's Data Engineering Expertise
5 pages
Data Engineer Resume: Big Data & Cloud Expertise
No ratings yet
Data Engineer Resume: Big Data & Cloud Expertise
4 pages
Akshay Chekuri: Data Engineer Profile
No ratings yet
Akshay Chekuri: Data Engineer Profile
4 pages
Azure Data Engineer Resume Summary
No ratings yet
Azure Data Engineer Resume Summary
5 pages
Data Engineer Abhijit Ahire Profile
No ratings yet
Data Engineer Abhijit Ahire Profile
2 pages
Data Engineering Expertise Overview
No ratings yet
Data Engineering Expertise Overview
6 pages
AWS Big Data Solution Architect Resume
No ratings yet
AWS Big Data Solution Architect Resume
8 pages
FIT1053 Python Workshop 1 Overview
No ratings yet
FIT1053 Python Workshop 1 Overview
5 pages
Styling Components in React.js Guide
No ratings yet
Styling Components in React.js Guide
8 pages
Django Framework Basics for Beginners
No ratings yet
Django Framework Basics for Beginners
3 pages
Odoo CRM: Customization and Lead Management
No ratings yet
Odoo CRM: Customization and Lead Management
10 pages
Overview of Selenium Web Automation
No ratings yet
Overview of Selenium Web Automation
3 pages
Evaluating Resilience in Open-World ML
No ratings yet
Evaluating Resilience in Open-World ML
6 pages
Public-Key Cryptography Overview
No ratings yet
Public-Key Cryptography Overview
48 pages
BIM Maturity Assessment in Hong Kong
No ratings yet
BIM Maturity Assessment in Hong Kong
26 pages
T24 Directory Structure Overview
No ratings yet
T24 Directory Structure Overview
23 pages
Understanding Text Types and Design Principles
100% (1)
Understanding Text Types and Design Principles
2 pages
PL/SQL Exercises for Database Students
No ratings yet
PL/SQL Exercises for Database Students
9 pages
Pioneer DJ DJM-A9 Serato DJ Pro Quick-Start Guide
No ratings yet
Pioneer DJ DJM-A9 Serato DJ Pro Quick-Start Guide
6 pages
Blockchain Startup LitePaper
No ratings yet
Blockchain Startup LitePaper
12 pages
TCL Service Centers in Indonesia
No ratings yet
TCL Service Centers in Indonesia
4 pages
Lenovo Ideacentre 510A User Guide
No ratings yet
Lenovo Ideacentre 510A User Guide
8 pages
Understanding Multiprogramming vs. Multiprocessing
No ratings yet
Understanding Multiprogramming vs. Multiprocessing
14 pages
Deep Learning in Drug Discovery
No ratings yet
Deep Learning in Drug Discovery
16 pages
Project Estimation Techniques Overview
No ratings yet
Project Estimation Techniques Overview
23 pages
Big Data Processing Exam Paper 2022
No ratings yet
Big Data Processing Exam Paper 2022
1 page
Process Automation Fundamentals Course
No ratings yet
Process Automation Fundamentals Course
3 pages
JSS1 First Term Computer Exam 2020
No ratings yet
JSS1 First Term Computer Exam 2020
4 pages
XtremIO Process Restart Script Guide
No ratings yet
XtremIO Process Restart Script Guide
9 pages
T-Mobile Tariff Plans Price List 2024
No ratings yet
T-Mobile Tariff Plans Price List 2024
49 pages
Project Report
No ratings yet
Project Report
34 pages
Arala International: Real Estate in Qatar
No ratings yet
Arala International: Real Estate in Qatar
1 page
QCA-Based Reversible Full Adder Design
No ratings yet
QCA-Based Reversible Full Adder Design
5 pages
OpenVox GSM Gateway Connect With Elastix® Server Quickstart Guide
No ratings yet
OpenVox GSM Gateway Connect With Elastix® Server Quickstart Guide
5 pages
DigiForce - 9311 - Graphic - Manual
No ratings yet
DigiForce - 9311 - Graphic - Manual
216 pages
Sequential Logic Testing in VLSI
No ratings yet
Sequential Logic Testing in VLSI
108 pages
B-Tree Implementation in C Language
No ratings yet
B-Tree Implementation in C Language
13 pages

Engineering Data Mesh in Azure Cloud

Uploaded by

Engineering Data Mesh in Azure Cloud

Uploaded by

gireeshchoudary2@gmail.

 Working with RDD’S, Data Frame’s in Pyspark frame works.

 Providing access to the users based on approval status.

 Working on daily, weekly, monthly and yearly store load activities.

 Providing the data extract reports based on the client requirements.

 Good understanding of Spark Architecture, Transformations and Actions.

 Working with incidents and Tasks.

 Providing the Audit reports on daily basses.

 Excellent analytical, communication and mentoring skills prove an asset to Organization.

 Quick learner and ability to mingle with the working environment.

Education and Certifications :

[Link] (Mechanical) Visvesvaraya Technological

Hardware / Platforms : Windows 7/10.

Relevant Project Experience

Project Title QVC-Netezza retirement plan

Roles and responsibilities:

1) Verifying and Validating Raw data after Receiving Netezza Database.

7) Working with incidents and Tasks.

Project Title Data Product P101 Dashboards

Client Name Anglo American

Roles and responsibilities:

Common questions

What challenges and opportunities does the implementation of Azure Data Factory and Databricks present in large-scale data migration projects?

How do PySpark and Azure SQL complement each other in transforming and managing enterprise data environments?

Why is the verification and validation of raw data crucial in data migration projects like the Netezza retirement plan?

What strategies can be employed to effectively handle incidents and tasks in Azure-based projects?

What is the significance of using data formats such as Avro, Json, and Parquet in Azure Data Lakes?

How does Gireesh K's professional experience and skills align with the demands of Azure and big data projects?

What role does Spark-SQL API play in processing large datasets in Databricks?

What are the main responsibilities involved in managing a data product dashboard project using Azure technologies?

How can leveraging Azure's integration capabilities with Power BI enhance decision-making in business environments?

What are the benefits and challenges of using Azure Data Lake Services (ADLS) for storing and extracting large files?

You might also like