价格方案
Avatar of Shyam Ahuja.
Shyam Ahuja
Senior Data Engineer
Profile
Reputation Credits
0

Posts
0Connections
列印
Avatar of the user.

Shyam Ahuja

Senior Data Engineer
Azure Architect with 14 years of experience in IT services and consulting, specializing in the design and implementation of data-intensive applications leveraging Azure Cloud and Big Data technologies. Proficient in architecting scalable solutions using Azure services such as Azure Data Lake, Azure Synapse Analytics, and Azure Databricks. Skilled in Spark analytics, AWS, Scala, Kafka, Hive, SQL, and Python, with a strong focus on data orchestration and integration. Proven track record in leading agile scrum teams, managing project lifecycles, and driving successful sprint grooming, planning, and coordination to deliver high-quality solutions that meet business objectives.
Logo of the organization.
Deutsche Post DHL
Jaypee Institute of Information Technology
Berlin, Bundesrepublik
德国

专业背景

  • 目前状态
    就职中
  • 专业
    数据工程师
    大数据开发人员
  • 产业
    大数据
    软件
    软件即服务 / 云服务
  • 工作年资
    10 到 15 年 (10 到 15 年相关工作经验)
  • 管理经历
    我有管理 5~10 人的经验
  • 技能
    Programming - Scala
    SQL
    Python
    Java
    Data - Spark
    Spark Streaming
    Kafka
    Databricks
    Hive
    ETL
    MapReduce
    Sqoop
    Cloud - Azure
    AWS
    Cloudera Enterprise
    Devops - Azure devops pipelines
    Git
    Docker
    Kubernetes
    Jenkins
    Tools - IntelliJ
    Eclipse
    IBM Cognos
    Microsoft SQL Server
    Oracle
    Visual studio
    Orchestration - Airflow
    Oozie
    Development Methodologies - AGILE
    Azure Cloud Services
    Azure
    Azure DevOps
    DataBricks
    Databricks SQL
    DataBricks • Data Engineering and ETL Tools: SSIS
    pyspark
    fabric
    Microsoft Fabric
  • 语言能力
    English
    专业
  • 最高学历
    大学

求职偏好

  • 目前状态
  • 预期工作模式
    全职
    对远端工作有兴趣
  • 希望获得的职位
    Cloud Solution Architect
  • 期望的工作地点
    Kreisfreie Stadt Frankfurt am Main, Darmstadt, Hessen, Deutschland
    Berlin, Bundesrepublik
    Muenchen, Regierungsbezirk Oberbayern, Bayern, Deutschland
    United States
  • 接案服务
    兼职接案者

工作经验

Logo of the organization.

Senior Data Engineer

2022年7月 - 现在
Berlin, Bundesrepublik
Engaged in a pivotal role within the E-commerce US Data Lake project, dedicated to crafting a centralized data lake platform on Azure. The primary goal is to seamlessly ingest data from diverse sources, encompassing both batch and real-time data, and empower business stakeholders across departments to construct data warehouses. The project, labeled ECS US Data Lake, involves the following responsibilities: - Facilitating effective communication with business stakeholders across various departments to comprehend requirements and orchestrate design discussions with the team. - Orchestrating the setup of Infrastructure Platform components and implementing security measures using automation tools, specifically Terraform. - Devising versatile Azure Data Factory pipeline templates to streamline the ingestion and consolidation of data from numerous source systems, spanning both batch and real-time environments. - Crafting and implementing consolidation strategies, along with business models and reports in Databricks using PySpark. - Design discussions with the platform and DevOps engineers to develop CI/CD processes and pipelines Working on Microsoft Fabric Evaluation and Migration Impact Analysis POC for Fabric Data Factory, Lakehouse and One Lake/Direct Lake
Logo of the organization.

Data Engineering Specialist

2021年7月 - 2022年7月
1 年 1 个月
Riga, Latvia
Engaged with a mining industry client, spearheading efforts to efficiently handle and refine data from diverse source systems through Azure Cloud technologies, including Data Factories, pipelines, and Azure Databricks. Project/Product: Centralized Data Provisioning Key Responsibilities: - Lead, mentor, and provide technical guidance to a team of 5 data engineers. - Innovatively designed a streaming ingestion and curation pattern utilizing Azure EventHub services. - Developed an ingestion and curation data pipeline for building a BI analytics-focused data lake. - Engineered an event-based system for consuming and curating data from event grid topics using Azure cloud services.
Logo of the organization.

Senior Data Engineer

2018年1月 - 2021年7月
3 年 7 个月
गुड़गांव जिला, हरियाणा, भारत
Contributed to the modernization of a platform product by leveraging Spark, Scala, and an AWS EC2 cluster for aggregations and transformations. This project focused on standardizing terabytes of incremental Market Ownership dataset for millions of companies. Project/Product - Ownership Incremental Standardization Key Achievements: - Implemented Spark-based parallel ingestion and processing framework, significantly improving performance compared to legacy systems. - Led the end-to-end delivery of a complex system involving multiple frameworks for ingestion, initialization, standardization, and aggregation. - Successfully optimized performance for complex problems, such as hierarchical queries and aggregates, using techniques like caching, repartitioning, and broadcasting. Project/Product - Modernization of Loaders using Kafka Key Contributions: - Executed the implementation of a real-time Kafka pipeline, handling millions of transactional CDC messages and processing them with business logic before persisting in the target database. - Led the end-to-end pipeline delivery, designing and developing the Kafka consumer with adherence to best practices and guidelines. - Achieved performance optimization and addressed latency issues in message consumption through Kafka optimization techniques and Docker in production.
Logo of the organization.

Big Data Consultant

2013年12月 - 2018年1月
4 年 2 个月
गुड़गांव जिला, हरियाणा, भारत
Project/Product - CHIEE (Clinical Health Information Engagement Enablement) - Anthem (Nov 2016 - Jan 2018) - Led the development of a data lake storing clinical healthcare data from wearable devices through a multi-layer framework.Responsibilities included designing a real-time data ingestion and processing framework using Kafka and Spark streaming,building data models for MongoDB, and integrating layers for an end-to-end data lake solution. Project/Product - D-rive Telematics (Usage-Based Insurance) - (Jan 2015 - Oct 2016) - Contributed to a Data Analytics Telematics solution capturing user data through sensors on a mobile app. Responsibilitiesinvolved designing and implementing rules and algorithms using Java MapReduce and Hive scripts, as well as developingOozie workflows. Project/Product - Anthem Healthcare Modernization (Jun 2014 - Dec 2014) - Focused on the ingestion and transformation of healthcare data from Teradata to Hadoop (Hive) in parquet format using SparkScala. Responsibilities included designing and building an ingestion and processing framework.
Logo of the organization.

Systems Engineer

2011年7月 - 2013年12月
2 年 6 个月
Hyderabad, तेलंगाना, भारत
Project/Product: Emerson Process Management (EPM) - Develop framework models and reports containing crucial measures for analyzing business status. These reports will empower management in making strategic decisions for organizational growth. Roles/Responsibilities: - Design and develop DataMarts using a framework manager to meet user requirements. - Implement security at different levels, including data, object, and package. - Design intricate reports and dashboards based on end-user specifications, incorporating features such as drill-through and master-detail functionality.
Logo of the organization.

Systems Engineer Trainee

2011年2月 - 2011年6月
5 个月
Mysore, कर्नाटक, भारत
Completed Microsoft DotNet Framework 3.5 training with a CGPA of 4.78 (5). Created a Windows Phone 7 application, 'EnRAllocation Management System,' using Silverlight and C#.

学历

学士学位
Bachelor of Technology in Computer Science & Engineering
2007 - 2011
7.2/10 GPA

资格认证

Logo of the organization.

Microsoft Certified: Azure Data Engineer Associate

Microsoft
2023年12月 到期
Logo of the organization.

AWS Certified Cloud Practitioner

Amazon Web Services (AWS)
2024年9月 到期

职场能力评价