為什麼大家喜歡在 RichWell Co.Ltd. 上班?
1.彈性上班-早上不趕打卡,想多睡一點、避開通勤人潮都OK。2.特休多多-不用等滿一年就能休假,我們比法規更大方,放假就是要爽爽的。3.獎金福利讚 年終、績效獎金該有的都有,努力絕對不白費。4.生日小驚喜,公司記得你的每個重要時刻。5.定期聚餐/Team Building 不只是工作夥伴,更是一起成長的戰友,吃吃喝喝感情更緊密。6.技術課、內部分享會,想學什麼我們都支持,讓你持續進化不退化!
About the roleWe are building a reliability-first platform. Over the next 12 months, we will stabilize our Windows-based services, strengthen observability, and progressively containerize into Kubernetes. You will be a key contributor driving self-service operations and data-driven reliability across the stack.
What you’ll do• Operational automation: Build self-service runbooks for Windows services (AWX/Rundeck), implement Ansible/PowerShell DSC workflows, health checks, and safe rollbacks implementations.• Observability: Standardize metrics/logs/traces (Prometheus/Grafana, windows_exporter, OpenTelemetry; ELK/Loki). Create golden-signal dashboards and actionable alerts.• Reliability engineering: Participates in on-call, handle incidents and post-incident reviews (PIR), and lead game days to institutionalize SOPs.• Resilience: Design and implement backup disaster recovery, capacity planning, and performance tuning.• Long-term: Drive service containerization and Kubernetes adoption (Helm/Kustomize, Argo CD/Flux, ConfigMap/Secrets) with a strong focus on security and compliance.