價格方案
Avatar of Shyam Ahuja.
Shyam Ahuja
Senior Data Engineer
個人檔案
職場能力評價
0

貼文
0個聯絡人
列印
Avatar of the user.

Shyam Ahuja

Senior Data Engineer
Azure Architect with 14 years of experience in IT services and consulting, specializing in the design and implementation of data-intensive applications leveraging Azure Cloud and Big Data technologies. Proficient in architecting scalable solutions using Azure services such as Azure Data Lake, Azure Synapse Analytics, and Azure Databricks. Skilled in Spark analytics, AWS, Scala, Kafka, Hive, SQL, and Python, with a strong focus on data orchestration and integration. Proven track record in leading agile scrum teams, managing project lifecycles, and driving successful sprint grooming, planning, and coordination to deliver high-quality solutions that meet business objectives.
Logo of the organization.
Deutsche Post DHL
Jaypee Institute of Information Technology
Berlin, Bundesrepublik
德國

專業背景

  • 目前狀態
    就職中
  • 專業
    數據工程師
    大數據開發人員
  • 產業
    大數據
    軟體
    軟體即服務 / 雲服務
  • 工作年資
    10 到 15 年 (10 到 15 年相關工作經驗)
  • 管理經歷
    我有管理 5~10 人的經驗
  • 技能
    Programming - Scala
    SQL
    Python
    Java
    Data - Spark
    Spark Streaming
    Kafka
    Databricks
    Hive
    ETL
    MapReduce
    Sqoop
    Cloud - Azure
    AWS
    Cloudera Enterprise
    Devops - Azure devops pipelines
    Git
    Docker
    Kubernetes
    Jenkins
    Tools - IntelliJ
    Eclipse
    IBM Cognos
    Microsoft SQL Server
    Oracle
    Visual studio
    Orchestration - Airflow
    Oozie
    Development Methodologies - AGILE
    Azure Cloud Services
    Azure
    Azure DevOps
    DataBricks
    Databricks SQL
    DataBricks • Data Engineering and ETL Tools: SSIS
    pyspark
    fabric
    Microsoft Fabric
  • 語言能力
    English
    專業
  • 最高學歷
    大學

求職偏好

  • 目前狀態
  • 預期工作模式
    全職
    對遠端工作有興趣
  • 希望獲得的職位
    Cloud Solution Architect
  • 期望的工作地點
    Kreisfreie Stadt Frankfurt am Main, Darmstadt, Hessen, Deutschland
    Berlin, Bundesrepublik
    Muenchen, Regierungsbezirk Oberbayern, Bayern, Deutschland
    United States
  • 接案服務
    兼職接案者

工作經驗

Logo of the organization.

Senior Data Engineer

2022年7月 - 現在
Berlin, Bundesrepublik
Engaged in a pivotal role within the E-commerce US Data Lake project, dedicated to crafting a centralized data lake platform on Azure. The primary goal is to seamlessly ingest data from diverse sources, encompassing both batch and real-time data, and empower business stakeholders across departments to construct data warehouses. The project, labeled ECS US Data Lake, involves the following responsibilities: - Facilitating effective communication with business stakeholders across various departments to comprehend requirements and orchestrate design discussions with the team. - Orchestrating the setup of Infrastructure Platform components and implementing security measures using automation tools, specifically Terraform. - Devising versatile Azure Data Factory pipeline templates to streamline the ingestion and consolidation of data from numerous source systems, spanning both batch and real-time environments. - Crafting and implementing consolidation strategies, along with business models and reports in Databricks using PySpark. - Design discussions with the platform and DevOps engineers to develop CI/CD processes and pipelines Working on Microsoft Fabric Evaluation and Migration Impact Analysis POC for Fabric Data Factory, Lakehouse and One Lake/Direct Lake
Logo of the organization.

Data Engineering Specialist

2021年7月 - 2022年7月
1 年 1 個月
Riga, Latvia
Engaged with a mining industry client, spearheading efforts to efficiently handle and refine data from diverse source systems through Azure Cloud technologies, including Data Factories, pipelines, and Azure Databricks. Project/Product: Centralized Data Provisioning Key Responsibilities: - Lead, mentor, and provide technical guidance to a team of 5 data engineers. - Innovatively designed a streaming ingestion and curation pattern utilizing Azure EventHub services. - Developed an ingestion and curation data pipeline for building a BI analytics-focused data lake. - Engineered an event-based system for consuming and curating data from event grid topics using Azure cloud services.
Logo of the organization.

Senior Data Engineer

2018年1月 - 2021年7月
3 年 7 個月
गुड़गांव जिला, हरियाणा, भारत
Contributed to the modernization of a platform product by leveraging Spark, Scala, and an AWS EC2 cluster for aggregations and transformations. This project focused on standardizing terabytes of incremental Market Ownership dataset for millions of companies. Project/Product - Ownership Incremental Standardization Key Achievements: - Implemented Spark-based parallel ingestion and processing framework, significantly improving performance compared to legacy systems. - Led the end-to-end delivery of a complex system involving multiple frameworks for ingestion, initialization, standardization, and aggregation. - Successfully optimized performance for complex problems, such as hierarchical queries and aggregates, using techniques like caching, repartitioning, and broadcasting. Project/Product - Modernization of Loaders using Kafka Key Contributions: - Executed the implementation of a real-time Kafka pipeline, handling millions of transactional CDC messages and processing them with business logic before persisting in the target database. - Led the end-to-end pipeline delivery, designing and developing the Kafka consumer with adherence to best practices and guidelines. - Achieved performance optimization and addressed latency issues in message consumption through Kafka optimization techniques and Docker in production.
Logo of the organization.

Big Data Consultant

2013年12月 - 2018年1月
4 年 2 個月
गुड़गांव जिला, हरियाणा, भारत
Project/Product - CHIEE (Clinical Health Information Engagement Enablement) - Anthem (Nov 2016 - Jan 2018) - Led the development of a data lake storing clinical healthcare data from wearable devices through a multi-layer framework.Responsibilities included designing a real-time data ingestion and processing framework using Kafka and Spark streaming,building data models for MongoDB, and integrating layers for an end-to-end data lake solution. Project/Product - D-rive Telematics (Usage-Based Insurance) - (Jan 2015 - Oct 2016) - Contributed to a Data Analytics Telematics solution capturing user data through sensors on a mobile app. Responsibilitiesinvolved designing and implementing rules and algorithms using Java MapReduce and Hive scripts, as well as developingOozie workflows. Project/Product - Anthem Healthcare Modernization (Jun 2014 - Dec 2014) - Focused on the ingestion and transformation of healthcare data from Teradata to Hadoop (Hive) in parquet format using SparkScala. Responsibilities included designing and building an ingestion and processing framework.
Logo of the organization.

Systems Engineer

2011年7月 - 2013年12月
2 年 6 個月
Hyderabad, तेलंगाना, भारत
Project/Product: Emerson Process Management (EPM) - Develop framework models and reports containing crucial measures for analyzing business status. These reports will empower management in making strategic decisions for organizational growth. Roles/Responsibilities: - Design and develop DataMarts using a framework manager to meet user requirements. - Implement security at different levels, including data, object, and package. - Design intricate reports and dashboards based on end-user specifications, incorporating features such as drill-through and master-detail functionality.
Logo of the organization.

Systems Engineer Trainee

2011年2月 - 2011年6月
5 個月
Mysore, कर्नाटक, भारत
Completed Microsoft DotNet Framework 3.5 training with a CGPA of 4.78 (5). Created a Windows Phone 7 application, 'EnRAllocation Management System,' using Silverlight and C#.

學歷

學士學位
Bachelor of Technology in Computer Science & Engineering
2007 - 2011
7.2/10 GPA

資格認證

Logo of the organization.

Microsoft Certified: Azure Data Engineer Associate

Microsoft
2023年12月 到期
Logo of the organization.

AWS Certified Cloud Practitioner

Amazon Web Services (AWS)
2024年9月 到期

職場能力評價