Overview
Microsoft Silicon, Cloud Hardware, and Infrastructure Engineering (SCHIE) is the team behind Microsoft’s expanding Cloud Infrastructure and responsible for powering Microsoft’s “Intelligent Cloud” mission. SCHIE delivers the core infrastructure and foundational technologies for Microsoft's over 200 online businesses including Bing, MSN, Office 365, Xbox Live, Teams, OneDrive, and the Microsoft Azure platform globally with our server and data center infrastructure, security and compliance, operations, globalization, and manageability solutions. Our focus is on smart growth, high efficiency, and deliver trusted experience to customers and partners worldwide, and we are looking for a Principal Software Engineer based in Taiwan to help achieve that mission.
To achieve this goal, we in the Hardware Health Service team within Azure are responsible for the design, implementation, and operation of global scalable cloud services to monitor the fleet’s hardware health and predict anomalies and pending failures. We focus on delivering the solutions required for our cloud service platforms at the lowest possible cost of ownership (TCO) and providing great customer experience on unreliable hardware.
Azure Hardware Health Service is looking for a Principal Software Engineer to be a part of the fast-paced and exciting business of Azure. This is your chance to be part of the most exciting end-to-end teams within Microsoft. We are looking for a Principal Software Engineer with deep expertise in Cloud Service development to lead the development and architect innovative software-defined solutions that powers Azure and make our world-class cloud infrastructure even better. To be successful in this role, you must have experience delivering quality results to customers, an engineering leadership mindset, an innate aptitude for agility, and proven technical excellence in software engineering at scale.
Responsibilities
Architect, design, and operate large scale, low latency, and high throughput cloud services.Strategically drive and lead highly complex and mission critical solutions that involve multiple Azure Services across global regions.
Define and measure the success/impact of requested analytics reporting features via quantitative measures.
Provide technical leadership in data analysis and feedback integration for product engineering decisions, acting as a Designated Responsible Individual (DRI) for monitoring and restoring system functionality within the Service Level Agreement (SLA) timeframe. Participates in live service operations, and supports telemetry data integration for system behavior insights, with a focus on performance, reliability, and safety.
Direct the identification of dependencies and design documentation for product features, architect system interactions and back-end dependencies, and lead architectural processes.
Guide the production of code to test hypotheses for technical solutions and assist with technical validation efforts. Oversee quality assurance plans, augment test cases, and integrate automation into testing, while defining the implications of security and compliance in system architecture.
Ensures compliance with security, privacy, safety, and accessibility standards, leverages developer tools for code creation and debugging, contributes to automation in production and deployment, and proactively seeks knowledge to improve product availability, reliability, efficiency, and performance at scale.
Collaborate effectively with remote teams in global locations to ensure architectural alignment and feature delivery while mentoring and coaching junior and senior engineers.