🍔 深耕北美大型連鎖餐飲市場 🍔 - AI 影像辨識創新 SaaS - DevOps Team Lead - AY

Job updated 3 months ago
The employer was active 8 minutes ago

Job Description

🏢 關於這個團隊

一家專注於 零售/速食餐飲 AI 解決方案的電腦視覺公司。團隊已獲得市場肯定,產品也已在美國大型零售場域實際部署與運行。目前正擴編工程團隊,以提升產品基礎架構與整體能力的規模化發展。

這是一支規模精實、但技術底子非常扎實的團隊,成員多來自大型國際科技公司與 AI 新創。公司背後有穩定的產業支持,同時保有新創該有的決策速度與工程彈性。

【Responsibilities】

Strategy & Ownership

  • Own and drive the DevOps / SRE roadmap, aligning infrastructure priorities with product and business growth.

  • Translate business needs into scalable, reliable, and cost-efficient platform strategies.

  • Define and evolve reliability standards, including SLOs and error budgets.

System Design & Technical Leadership

  • Lead the design of scalable and resilient infrastructure and runtime systems.

  • Drive architectural decisions across services, including CI/CD, deployment workflows, and platform design.

  • Anticipate system risks and design for fault tolerance and long-term operability.

Problem Solving & Execution

  • Work cross-functionally with product, backend, data, and security teams to turn requirements into actionable technical plans.

  • Break down complex or ambiguous problems and make clear, pragmatic decisions.

  • Balance trade-offs between speed, reliability, and system complexity.

Collaboration & Communication

  • Act as a key interface between engineering, product, and business teams.

  • Clearly communicate technical strategies, risks, and priorities to both technical and non-technical stakeholders.

  • Lead incident communication, coordination, and post-incident reviews.

Team & Operational Excellence

  • Mentor and support DevOps / SRE engineers, helping raise overall technical standards.

  • Improve team processes, including on-call practices, incident handling, and operational readiness.

  • Drive continuous improvements through metrics, postmortems, and best practices to ensure system reliability and performance.

Requirements

【Qualifications】

Experience & Background

  • 7+ years of experience in DevOps, SRE, or Platform Engineering roles.

  • Experience leading infrastructure or reliability-related initiatives, with a strong sense of ownership.

  • Proven track record designing and operating production systems at scale, including both cloud and on-premises environments.

  • Hands-on experience managing or building on-prem infrastructure (data center, hybrid setups, or self-hosted environments) is highly preferred.

Technical Expertise

  • Strong fundamentals in Linux systems, networking, and distributed systems design.

  • Solid experience designing systems with a focus on scalability, availability, and fault tolerance.

  • Practical experience with:

    • Cloud platforms (e.g., AWS, GCP)

    • Container ecosystems and orchestration (e.g., Kubernetes)

    • Infrastructure as Code tools (e.g., Terraform, ArgoCD)

    • CI/CD pipeline design and automation

  • Deep understanding of observability practices, including metrics, logging, and tracing.

Reliability & Operations

  • Experience defining and operating services using SLI / SLO / SLA frameworks.

  • Familiar with error budget–driven decision making and reliability trade-offs.

  • Strong experience in incident response, root cause analysis, and system recovery.

  • Ability to perform capacity planning and performance optimization for production systems.

Leadership & Communication

  • Strong ability to translate complex technical topics into clear plans and align cross-functional teams.

  • Comfortable working with engineers, product managers, and business stakeholders.

  • Experience mentoring engineers and contributing to technical direction and decision-making.

  • Able to operate effectively in ambiguous environments and take ownership of outcomes.

Nice to Have

  • Experience operating multi-region or hybrid cloud environments.

  • Exposure to security, compliance, or data protection practices.

  • Experience scaling systems or teams in high-growth environments.

  • Background in backend or distributed system development.


Interview process

若您對有興趣應徵此職位,歡迎直接寄送您的履歷到以下信箱,並註明您要應徵的職位,若合適的話,我會再回信與您安排時間詳聊,進一步分享職缺資訊喔!

Thank You!!

📩 Email: [email protected]

🔎 Line: https://line.me/ti/p/D7mgXr7Kbl

1
7 years of experience required
1,500,000 ~ 2,500,000 TWD / year
Managing 1-5 staff
Partial Remote Work
Personal Invitation Link
This is your personal referral link for job invitation. You'll receive an email notification when someone applied for the position via your job link.
Share this job
People who applied for this job also applied for

About us

🔥This is the Cake Recruitment Consulting official web page🔥

Cake Recruitment Consulting - Executive Search, Contracting, EoR/Payroll

We offer a full range of services, including Executive Search, Contracting, and EoR/Payroll, to meet all your recruitment needs.

Our Special Advantages and Differentiation:

  • Professional Recruiter + Direct Sourcing - All consultants have industry experience. In addition to providing accurate talent search. E.g Identify relevant client’s competitors, and engage in comprehensive talent acquisition, building long-term relationships with candidates, and providing market information to clients.
  • CakeResume AI + Martech Recruitment - By leveraging digital Martech technology and employer branding, the recruitment process is optimized. Through a database of millions of potential candidates who would not normally appear, the number of candidates reached is increased and speeds up the sourcing process to fill the talent gap.

About Cake Recruitment Consulting
Cake is the Leading Asia talent platform. With over millions+ active resumes in our database and still it is increasing every day.

Currently more than 7,000+ clients are using our recruitment services.