Avatar of ithrael w.
ithrael w
devops/SRE
ProfileResume
Posts
2Connections
Print
Avatar of the user.

ithrael w

devops/SRE
Github: https://github.com/Ithrael ● Experience in architecting products with hundreds of millions of users or millions of daily active users, with familiarity in designing, refactoring, and optimizing high-load systems. ● Passionate about technology, with a keen interest in the development of new technologies. Actively involved in the open-source community, contributing to well-known projects like k8s/helm/exporter. A member of the Kubernetes organization and ranked 16th in contributions to the Helm project over the past two years. ● Proficient in public cloud platforms (Aliyun/AWS) with extensive experience using Kubernetes, and expertise in Kubernetes architecture and operations. Skilled in using tools and technologies such as Kubernetes, Docker, Helm, Prometheus, and Grafana. ● Experienced in managing teams of up to 10 people, with a focus on enhancing team effectiveness and execution. ● In-depth understanding and rich experience in cybersecurity. Recognized as a top white-hat hacker with notable achievements: 4-time Apple SRC Hall of Fame, 1st in Intsig SRC, 1st in Longfor SRC, 3rd in SF Express SRC (2020), 2nd in Huoxian SRC, 4th in Wanmei SRC, 2nd in Momo SRC (2023), and 2nd in ZTO SRC (2019).
Nanjing Beiwan Information Technology Co., Ltd
China University of Mining and Technology

Professional Background

  • Current status
    Employed
    Ready to interview
  • Profession
  • Fields
  • Work experience
    6-10 years relevant
  • Management
  • Highest level of education

Job search preferences

  • Desired job type
    Full-time
    Interested in working remotely
  • Desired positions
    Software Engineer / Backend Engineer
  • Desired work locations
  • Freelance

Work Experience

DEVOPS/SRE

May 2019 - Present
Nanjing Shi, Jiangsu, China
Cost Reduction and Efficiency Improvement ● Cloud Product Downgrade: Managed the configuration usage of cloud services such as MySQL, Redis. Implemented an elastic bandwidth strategy during peak business periods to meet performance needs while reducing costs. ● Node Utilization Improvement: Employed elastic nodes in Kubernetes (k8s) and introduced CronHPA and HPA. Developed automatic resource management using k8s webhooks and Prometheus. ● Effect: Reduced cloud service expenses by 30% year-over-year. Monitoring and Alerts ● Monitoring and Alert System: Set up a monitoring and alert system using Exporter-Prometheus-Grafana-Alertmanager-DingTalk. ● MQ Cluster Setup: Established a self-managed MQ cluster and created SLA. ● ELK Log System Optimization: Enhanced the ELK-based logging system to improve the comprehensiveness and visualization of logs, ensuring comprehensive recording of system operation and anomalies, thus providing better data support and analysis for troubleshooting and problem resolution. ● Application Tiering: Defined application levels and established Standard Operating Procedures (SOPs) for on-call duties, ensuring A/B-level service alert response times within 10 minutes. Service Stability Assurance ● Kubernetes Operations: Managed k8s operations and performed cross-major version upgrades. ● Node Affinity Configuration: Allocated services to different namespaces based on business needs and assigned them to different node pools. ● CronHPA Introduction: Adapted to traffic surges during specific time periods. ● Cluster Backup and Recovery: Used Terraform and Velero for cluster backups and rapid recovery. ● Effect: Reduced the incidence rate of P0/P1 issues by over 20% year-over-year. CICD ● Helm Charts Development: Developed Helm Charts and implemented Helm-based deployment for the entire site to improve system maintainability and scalability. ● GitLab Optimization: Enhanced the GitLab release process to increase image build and service deployment speed

Soft Engineer

Sep 2017 - May 2019
1 yr 9 mos
Nanjing Shi, Jiangsu, China
Service Deployment/Operations Built front-end and back-end services from scratch on the Alibaba Cloud platform. Managed the entire process of development, deployment, operations, and go-live to ensure system high availability and stability. Enterprise Data Scraping Responsible for collecting and scraping various types of enterprise data using technical methods. Encountered and bypassed common anti-scraping techniques (such as Ruishu, slider, SVG anti-scraping, etc.) to provide comprehensive data support for business decisions and analysis.

Software Engineer

Mar 2016 - Sep 2017
1 yr 7 mos
Nanjing Shi, Jiangsu, China
Distributed Web Scraper Development and Deployment Designed and developed a distributed web scraping system based on Jenkins, handling the overall architecture and development of data collection. Implemented automated deployment with Jenkins to ensure system stability and continuous data collection. Internet Data Collection and Scraping Responsible for collecting and scraping various types of internet data, including text, images, and videos. Utilized technical methods to obtain diverse data types, providing comprehensive data support for business decision-making and analysis.

Education

Bachelor’s Degree
Computer Science and Technology
2012 - 2016