網站可靠性工程師 Site Reliability Engineer(SRE)/DevOps Engineer (AWS, Kubernetes)

Job updated 9 months ago
The employer was active about 21 hours ago

Job Description

SRE 的使命:打造兼具彈性與極致可靠的數位服務

TVBS 的服務架構承載著千萬用戶等級的日常流量,橫跨新聞、影音內容、電商等多個產品線。SRE 團隊是確保這套複雜系統 7x24 穩定運行的核心力量。我們不只是維運基礎設施,更是開發團隊最信賴的夥伴,致力於透過架構優化、流程自動化與 SRE 最佳實踐,在快速迭代的開發節奏中,捍衛服務的可靠性與效能。

我們正在尋找一位具備深厚技術底蘊和自動化思維的 SRE 工程師。您將主導高流量環境下的 Kubernetes 叢集管理、設計並實施 Infrastructure as Code (IaC),並推動監控系統的進化。如果您熱衷於消除系統瓶頸、降低維運成本(Toil Reduction),並相信 automated everything 是未來趨G勢,這裡是您發揮影響力的最佳舞台。

【主要職責】 (Responsibilities)

  1. SRE 實踐與可靠性定義: 主導 SRE 核心實踐導入,包含定義服務等級目標 (SLO)、建立錯誤預算 (Error Budgets),並推動 Postmortem 文化,從根本上提升系統可靠性。
  2. Kubernetes (EKS) 運維與優化: 負責 AWS EKS 叢集的日常管理、效能調校、成本優化與高可用性架構設計,確保容器化應用的穩定運行。
  3. 基礎設施即代碼 (IaC) 與自動化: 使用 Terraform 或 Cloudformation 全面管理雲端資源。開發自動化腳本(Python/Shell)以減少手動操作,提升部署效率與一致性。
  4. 監控系統與可觀測性 (Observability): 建構與維護全面的監控告警系統(如 Prometheus, Grafana, OpenSearch/ELK),確保能即時發現並定位問題,並持續優化可觀測性。
  5. CI/CD 流程優化: 擁有並持續改善 CI/CD Pipeline,與開發團隊協作,加速軟體交付速度並確保部署品質。
  6. 緊急應變與效能調校: 擔任 On-call 輪值,處理線上緊急事件,並針對系統效能瓶頸進行分析與調優。

The SRE Mission: Building Resilient and Scalable Digital Services

TVBS's service architecture supports daily traffic from tens of millions of users across multiple product lines, including news, video content, and e-commerce. The SRE team is the core force ensuring the 24/7 stability of this complex system. We don't just maintain infrastructure; we are the most trusted partners of the development teams. We safeguard service reliability and performance amidst rapid development cycles through architectural optimization, process automation, and SRE best practices.

We are looking for an SRE with deep technical expertise and an automation-first mindset. You will lead the management of Kubernetes clusters in a high-traffic environment, design and implement Infrastructure as Code (IaC), and drive the evolution of our monitoring systems. If you are passionate about eliminating system bottlenecks, reducing operational toil, and believe that automating everything is key to future success, this is the perfect stage for you to make an impact.

Key Responsibilities

  1. SRE Practices and Reliability Definition: Lead the implementation of core SRE practices, including defining Service Level Objectives (SLOs), establishing Error Budgets, and promoting a Postmortem culture to fundamentally enhance system reliability.
  2. Kubernetes (EKS) Operations and Optimization: Manage the daily operations, performance tuning, cost optimization, and high-availability design of AWS EKS clusters to ensure the stability of containerized applications.
  3. Infrastructure as Code (IaC) and Automation: Manage cloud resources comprehensively using Terraform or Cloudformation. Develop automation scripts (Python/Shell) to reduce manual operations and improve deployment efficiency and consistency.
  4. Monitoring Systems and Observability: Build and maintain comprehensive monitoring and alerting systems (e.g., Prometheus, Grafana, OpenSearch/ELK) to ensure real-time issue detection and localization, and continuously improve observability.
  5. CI/CD Pipeline Optimization: Own and continuously improve CI/CD pipelines, collaborating with development teams to accelerate software delivery while ensuring deployment quality.
  6. Incident Response and Performance Tuning: Participate in on-call rotations, handle production emergencies, and conduct performance analysis and tuning for system bottlenecks.

Requirements

【必要條件】 (Must Have Qualifications)

  1. 雲端平台經驗: 具備 AWS 雲端服務的實務經驗,熟悉核心服務(如 EC2, RDS, VPC, ALB, S3)。
  2. 容器化技術: 精通 Docker 容器技術,並具備 Kubernetes (K8s) 叢集管理與維運的深入經驗(EKS 經驗者佳)。
  3. Infrastructure as Code (IaC): 熟悉 Terraform 或 Cloudformation 等 IaC 工具,並有實際應用於生產環境的經驗。
  4. 自動化腳本能力: 熟悉 Shell Script,且精通至少一種後端程式語言(Python 優先)。
  5. 監控工具實務: 具備建置或維護監控系統的經驗(如 Prometheus, Grafana, ELK stack/OpenSearch)。
  6. 系統與網路基礎: 熟悉 Linux 系統操作與問題排查,並具備紮實的網路知識 (TCP/IP, DNS, Load Balancing)。

【加分條件】 (Nice to Have Qualifications)

  1. 具備多雲平台經驗(GCP, Azure)。
  2. 對網路安全、DevSecOps 流程有實際經驗或濃厚興趣。
  3. 具備高流量網站架構設計或效能調校經驗。
  4. 熟悉 GitOps 流程與相關工具(如 ArgoCD, Flux)。

Must-Have Qualifications

  1. Cloud Platform Experience: Hands-on experience with AWS cloud services, with strong knowledge of core components (e.g., EC2, RDS, VPC, ALB, S3).
  2. Container Technologies: Proficiency in Docker container technology and deep experience in managing and operating Kubernetes (K8s) clusters (EKS experience preferred).
  3. Infrastructure as Code (IaC): Practical production experience with IaC tools such as Terraform or Cloudformation.
  4. Automation and Scripting Proficiency: Proficiency in Shell scripting and at least one backend programming language (Python preferred).
  5. Monitoring Tool Experience: Experience building or maintaining monitoring stacks (e.g., Prometheus, Grafana, ELK stack/OpenSearch).
  6. System and Networking Fundamentals: Strong understanding of Linux system administration, troubleshooting, and solid networking knowledge (TCP/IP, DNS, Load Balancing).

Nice-to-Have Qualifications

  1. Experience with multi-cloud environments (GCP, Azure).
  2. Practical experience or strong interest in network security and DevSecOps practices.
  3. Experience designing or tuning architecture for high-traffic websites.
  4. Familiarity with GitOps principles and tools (e.g., ArgoCD, Flux).
1
1 years of experience required
50,000 ~ 100,000 TWD / month
Partial Remote Work
Meet the Hiring Team
Avatar of the user.
經理
Personal Invitation Link
This is your personal referral link for job invitation. You'll receive an email notification when someone applied for the position via your job link.
Share this job
People who applied for this job also applied for
Full-time
Entry level
1
50K+ TWD / month
Full-time
Mid-Senior level
3
900K ~ 2.2M TWD / year
Full-time
Entry level
1
40K ~ 75K TWD / month
Full-time
Mid-Senior level
1
Negotiable
Full-time
Mid-Senior level
1
1M ~ 2M TWD / year

About us



定位品牌為「科技媒體」,從強化內容出發,輻射出各種經紀、電視、網路節目、電商等業務,創造出最頂尖品牌。

我們正在「世界翻轉中」
TVBS是台灣第一個衛星電視頻道,1993年9月28日正式發聲,我們的出現,結束了由無線三台壟斷數十年的局面,帶領台灣進入一個多元自由、百家爭鳴的融媒體新時代!

我們是「看板人物」
1.探索真相,發揮影響力
關心每天世界上發生的大小事件,透過專業、生動的文字與畫面,帶領觀眾探索真相。
2.匯聚新知,話題不冷場
提供最新最夯的話題,橫跨電視與網路,打造24小時不冷場的話題新平台。
3.散播歡樂,舒展身心靈
用歡樂幫觀眾做心靈馬殺雞,趕走煩悶,散播愉快好心情。
4.領先技術,媒體新視界
從衛星到網路,從類比到HD,從電視到手機,搶先業界求新求變,持續呈現最高品質。
5.點燃創意,碰撞新火花
燃燒腦力,打造創意好點子,源源不斷的新想法,就是我們每天的活力來源。
6.串連資源,服務最即時
串起緊密連結,成為前線同仁堅強的後盾。

我們重視的「T觀點」
真實Truth、信賴Trust、科技Technology
「持續運用最領先的科技、提供最真實的資訊,成為最受信賴的品牌」

我們要「一步一腳印 發現新台灣」 
We Lead The Market 與時俱進,以最新科技與專業維持媒體領導品牌的地位 
We Are The Consciousness Of The Society 導正台灣媒體生態,真實公正報導,堅守社會教育者的良知良能
We Care About This Land And The People Live On It 善用媒體資源,善盡社會責任,關懷生態文明與永續發展 

我們的「十點不一樣」
立足亞洲市場,運用全球思維,延伸媒體觸角。以多角佈局,結合旗下IP品牌,打造兼具新聞報導、影視娛樂、電商通路、演藝經紀和生技產品等於一身的全方位媒體。