Cari Lowongan

Advanced filters
Off
Level Menengah-Senior
為什麼大家喜歡在 RichWell Co.Ltd. 上班? 1.彈性上班-早上不趕打卡,想多睡一點、避開通勤人潮都OK。2.特休多多-不用等滿一年就能休假,我們比法規更大方,放假就是要爽爽的。3.獎金福利讚 年終、績效獎金該有的都有,努力絕對不白費。4.生日小驚喜,公司記得你的每個重要時刻。5.定期聚餐/Team Building 不只是工作夥伴,更是一起成長的戰友,吃吃喝喝感情更緊密。6.技術課、內部分享會,想學什麼我們都支持,讓你持續進化不退化! About the roleWe are building a reliability-first platform. Over the next 12 months, we will stabilize our Windows-based services, strengthen observability, and progressively containerize into Kubernetes. You will be a key contributor driving self-service operations and data-driven reliability across the stack. What you’ll do• Operational automation: Build self-service runbooks for Windows services (AWX/Rundeck), implement Ansible/PowerShell DSC workflows, health checks, and safe rollbacks implementations.• Observability: Standardize metrics/logs/traces (Prometheus/Grafana, windows_exporter, OpenTelemetry; ELK/Loki). Create golden-signal dashboards and actionable alerts.• Reliability engineering: Participates in on-call, handle incidents and post-incident reviews (PIR), and lead game days to institutionalize SOPs.• Resilience: Design and implement backup disaster recovery, capacity planning, and performance tuning.• Long-term: Drive service containerization and Kubernetes adoption (Helm/Kustomize, Argo CD/Flux, ConfigMap/Secrets) with a strong focus on security and compliance.
Windows Server
Site Reliability Engineer
Prometheus/Grafana
1.6 jt ~ 2.2 jt TWD / tahun
Diperlukan pengalaman selama 4 tahun
Tidak ada tanggung jawab manajemen
【Company Highlights】 致力於提供高效、創新的解決方案,滿足客戶在科技和商業上的各種需求。憑藉豐富的經驗和專業知識,支持客戶在數字化轉型、 軟體開發、IT基礎設施管理及其他關鍵領域的需求 除了技術精實穩定的團隊,也提供讚讚的福利! 包含:彈性上下班制度、優於法規的特休制度、具競爭力的獎金制度、定期聚餐與團隊活動、貼心的生日與節日驚喜、持續學習與成長支持, etc.【Responsibilities】 主導應用程式部署管線的開發與實作,採用 Infrastructure as Code (IaC) 工具,並著重使用 Ansible、Kubernetes (K8s) 以及透過 Jenkins/ArgoCD 建立 CI/CD 流程。與 CTO 及基礎架構團隊密切合作,制定技術策略,確保平台架構與產品路線圖一致。規劃與優化監控與遙測能力,採用 Prometheus/Grafana 技術堆疊及 OpenTelemetry 標準,確保系統全方位可觀測性。SRE 團隊 的技術藍圖與開發計畫(包含災難復原架構與執行)與公司整體願景與技術規劃對齊。透過自動化推動系統的可持續擴展,並持續優化系統架構,以提升可靠性、交付速度與運行效率。領導與指導 SRE 團隊,建立積極主動、責任感與資源掌控的團隊文化。
SRE
On-Premise
Kubernetes
1.8 jt ~ 2.5 jt TWD / tahun
Diperlukan pengalaman selama 6 tahun
Mengatur 1-5 staf
【Company Highlights】 🌟 Specializing in AI-driven customer service solutions and virtual assistants, and using natural language processing and machine learning for automated interactions🌟 Aims to enhance customer experience and streamline business operations through AI technology🌟 Fully remote with competitive package and benefits 【Responsibilities】 Design and architect robust, resilient, and scalable OpenStack cloud infrastructure to meet the organization's computing, storage, and networking requirements Lead the deployment, configuration, and integration of core OpenStack services including Nova, Neutron, Cinder, Glance, Keystone, Horizon, and Heat Automate the provisioning and management of OpenStack environments using tools like Ansible, Puppet, or Heat Ensure high availability and fault tolerance across the OpenStack control plane and compute/storage resources Monitor and troubleshoot issues within the OpenStack environment, and implement proactive measures to maintain optimal performance Collaborate with the network, storage, and security teams to integrate OpenStack with existing infrastructure Develop and document standard operating procedures for deploying, upgrading, and maintaining the OpenStack environment Provide technical guidance and support to the cloud operations team Stay up-to-date with the latest OpenStack releases and roadmap, and evaluate newfeatures and capabilities for potential adoption
Logging
Networking Concepts
Architecture
1.5 jt ~ 3 jt TWD / tahun
Diperlukan pengalaman selama 3 tahun
Tidak ada tanggung jawab manajemen
Established in 1987 and headquartered in Taiwan, TSMC pioneered the pure-play foundry business model with an exclusive focus on manufacturing its customers’ products. As of 2024, TSMC serves more than 500 customers and manufactures over 11,000 products for high-performance computing, smartphones, the Internet of Things (IoT), automotive, and digital consumer electronics. It is the world’s largest provider of logic ICs, with an annual capacity of 16 million 12-inch equivalent wafers. TSMC operates fabs in Taiwan as well as manufacturing subsidiaries in Washington State, Japan and China, and the Company began construction on a specialty technology fab in Dresden, Germany, in 2024. In Arizona, TSMC is building three fabs, with the first starting 4nm production in 2025, the second by 2028, and the third by the end of the decade. We are seeking outstanding engineers to join TSMC IT infrastructure team to build and operate IT advanced Data Center to support world-class semiconductor foundry.This team is responsible for designing, implementing and optimizing IT infrastructure towards software-defined computing, storage and networking with advanced cloud technologies.The successful candidate should have strong technical skills and dedication for operation excellence. Responsibilities: Your responsibilities include:1. Network/Storage Design and Management:(1) Design, construction, operation, and capacity planning of large-scale NAS storage/object storage. (2) Operate and manage network infrastructure, including LAN, WLAN, Firewall, and Proxy.(3) Design, implement, and manage scalable network architecture aligned with business goals and industry best practices. 2. Automation and Scripting:(1) Develop and maintain automation scripts for network configuration, monitoring, and management using tools like Ansible and Python. (2) Transform repeatable tasks into automation tools to streamline operations and maximize efficiency. 3. Application Development:(1) Develop state-of-the-art applications and refactor existing applications for improved performance and maintainability.(2) Write and implement tests (unit/feature/integration) to guarantee software integrity. 4. Monitoring and Troubleshooting:(1) Implement monitoring solutions to proactively identify and resolve network and application issues. (2) Perform root cause analysis and corrective actions to troubleshoot technical challenges, including Linux-related systems and logs (e.g., Go code/log analysis). (3) Infrastructure operation issues(network/server/storage/security) visualization and countermeasure planning. Additional information for the job:Job Location: Hsinchu Site, Taichung Site, Tainan Site, Taipei Office (Experienced Only)On-call needs: On-call 1 week every 3 months 1. Manager interview2. Hackerrank test3. On-site personality and English test(which could be replaced if you have script of an official English test)4. HR interview5. Second manager interview (Optional assessment)6. Technical review (Optional assessment)
台灣新竹市新竹
Negotiable
Tidak ada persyaratan pengalaman kerja terkait
Tidak ada tanggung jawab manajemen
❗️投遞履歷請一律至專屬的職缺網頁:https://25sprout.teamdoor.io/s/ML8ElGFS 目前此職缺為常態徵才,直接透過 Cake平台投遞將不會回覆唷 我們正在尋找一位 Mid-level SRE(Site Reliability Engineer),成為團隊的可靠後盾。你的任務是確保系統穩定運行、雲端環境高效管理、流程持續自動化,讓用戶體驗更順暢、工程師開發更專注。如果你熱愛新技術,喜歡動手解決問題,也樂於與不同角色協作,歡迎加入我們一起:) ▍你的工作將包括: Linux 作業系統管理與維運(RedHat / Debian / Ubuntu 等)網站/應用環境建置與維護(LAMP / LNMP)CI/CD 流程整合與最佳化(Jenkins / GitLab CI/CD)憑證、金鑰與機密管理(SSL/TLS、Vault 等)雲端平台資源管理(AWS EC2 / S3 / RDS、Azure、GCP 等)建置監控與告警系統,確保服務高可用性(Prometheus / Grafana / ELK)自動化工具與基礎架構即程式碼導入(Terraform / Ansible / CloudFormation)
60 rb ~ 70 rb TWD / bulan
Diperlukan pengalaman selama 3 tahun
Tidak ada tanggung jawab manajemen
【Company Hihglights】 致力於提供高效、創新的解決方案,滿足客戶在科技和商業上的各種需求。憑藉豐富的經驗和專業知識,支持客戶在數字化轉型、 軟體開發、IT基礎設施管理及其他關鍵領域的需求 除了技術精實穩定的團隊,也提供讚讚的福利! 包含:彈性上下班制度、優於法規的特休制度、具競爭力的獎金制度、定期聚餐與團隊活動、貼心的生日與節日驚喜、持續學習與成長支持, etc. 【Responsibilities】 運用 SRE 最佳實踐,確保平台基礎架構的高可用性與可擴展性。 建置並維護 Jenkins 與 ArgoCD 的 CI/CD 部署流程,提升交付效率與穩定性。 使用 Ansible 和 Kubernetes 等基礎架構即程式化(IaC)工具進行應用程式部署。 建立以 Prometheus 與 Grafana 為核心的監控與觀測系統,確保系統可視性。 設計並執行災難備援與備份方案,保護關鍵系統安全。 推動系統自動化以支援可持續擴展,提升整體系統效能與開發速度。 負責產品環境的 on-call 支援,快速解決關鍵系統問題。 管理 Windows/Linux 伺服器與網路設定,確保系統穩定運作。 維運網站伺服器環境,如 IIS 與 Nginx。 積極與技術與非技術團隊協作,展現高責任感與主動解決問題的能力。
SRE
K8S
IDC
1 jt ~ 2.5 jt TWD / tahun
Diperlukan pengalaman selama 3 tahun
Tidak ada tanggung jawab manajemen
1. 協助設計、導入及優化 CI/CD pipeline,導入自動化測試與部署,提升交付品質 2. 負責系統部署、流程優化、高可用架構設計,確保系統維持高可用狀態 3. 系統環境建置與管理,熟悉GCP/AWS平台Kubernetes, Docker 4. 透過 Python、Shell、Ansible 開發自動化腳本完成日常重複性工作,有效提升整體效率 5. 能夠建置並維運 Nagios / Grafana / Loki / Prometheus 等工具,建立即時通報與處置機制,確保系統穩定 6. 分析網路資料傳輸與網路安全架構等特性,以規劃和維護網際網路系統之正常運作 7. 協助建置一般資料保護規則及落實資訊安全機制 8. 針對雲端架構規劃提供技術性建議
資料備份與復原
規劃與管理防火牆
網路規劃管理
Negotiable
Diperlukan pengalaman selama 2 tahun
Tidak ada tanggung jawab manajemen
Established in 1987 and headquartered in Taiwan, TSMC pioneered the pure-play foundry business model with an exclusive focus on manufacturing its customers’ products. As of 2024, TSMC serves more than 500 customers and manufactures over 11,000 products for high-performance computing, smartphones, the Internet of Things (IoT), automotive, and digital consumer electronics. It is the world’s largest provider of logic ICs, with an annual capacity of 16 million 12-inch equivalent wafers. TSMC operates fabs in Taiwan as well as manufacturing subsidiaries in Washington State, Japan and China, and the Company began construction on a specialty technology fab in Dresden, Germany, in 2024. In Arizona, TSMC is building three fabs, with the first starting 4nm production in 2025, the second by 2028, and the third by the end of the decade.As a platform engineer, you will focus on designing, implementing, and maintaining scalable features and services on the platform to support the productionization of applications that supportthe company’s RD/Fab/Business/IT/Security functions to improve the productivity and work quality. Responsibilities: Your responsibilities include:1. Automation and Scripting(1) Develop and maintain automation scripts for configuration, monitoring, and management using tools such as Ansible and Python.(2) Transform repeatable tasks into automation tools to streamline operations and maximize efficiency.(3) Implement Infrastructure as Code (IaC) to automate resource provisioning and CI/CD workflows. 2. Application Development (1) Develop scalable cloud-native microservice architectures for IT applications.(2) Develop state-of-the-art applications and refactor existing ones to improve performance and maintainability.(3) Apply software design principles, such as 12-factor app, to ensure sustainability and quality.(4) Write and implement tests (unit/feature/integration) to guarantee software integrity. 3. Monitoring and Troubleshooting (1) Implement monitoring solutions to proactively identify and resolve network and application issues.(2) Conduct root cause analyses and apply corrective actions to troubleshoot technicalchallenges, including Linux-related systems and logs (e.g., Go code/log analysis).(3) Lead evaluation and adoption of new IT technologies for continuous improvement. 4. (Optional) Network Design and Management (1) Operate and manage network infrastructure, including LAN, WLAN, Firewall, and Proxy.(2) Design, implement, and manage scalable network architecture aligned with business goals and industry best practices. Additional information for the job: Job Location: Hsinchu Site, Taichung Site, Tainan Site, Taipei Office (Experienced Only)On-call needs: On-call 1 week every 3 months The complete interview process includes: 1. Manager interview2. Hackerrank test3. On-site personality and English test (which could be replaced if you have script of an official English test)4. HR interview5. Second manager interview (Optional assessment)6. Technical review (Optional assessment)
Negotiable
Diperlukan pengalaman selama 5 tahun
Tidak ada tanggung jawab manajemen
About BTSE:彼特思方舟 is a specialized service provider dedicated to delivering a full spectrum of front-office and back-office support solutions, each of which are tailored to the unique needs of global financial technology firms. 彼特思方舟 is engaged by BTSE Group to offer several key positions, enabling the delivery of cutting-edge technology and tailored solutions that meet the evolving demands of the fintech industry in a competitive global market.BTSE Group is a leading global fintech and blockchain company that is committed to building innovative technology and infrastructure. BTSE empowers businesses and corporate clients with the advanced tools they need to excel in a rapidly evolving and competitive market. BTSE has pioneered numerous trading technologies that have been widely adopted across the industry, setting new benchmarks for innovation, performance, and security in fintech. BTSE’s diverse business lines serve both retail (B2C) customers and institutional (B2B) clients, enabling them to launch, operate, and scale fintech businesses. BTSE is seeking ambitious, motivated professionals to join our B2C and B2B teams.About the Opportunity:We are looking for a Senior Infrastructure Cloud Engineer to design, build, and maintain robust AWS-based cloud infrastructure that powers our mission-critical systems. You will champion Infrastructure as Code, automation, and modern DevOps practices to deliver scalable, reliable, and secure cloud environments. This is a hands-on technical role where you’ll collaborate across teams to drive operational excellence and innovation in our cloud platform.Responsibilities:Architect Build: Design and implement secure, scalable, and automated AWS infrastructure solutions.CI/CD Excellence: Maintain and enhance CI/CD pipelines using GitLab CI, ArgoCD, and related tools.Observability: Build and manage monitoring and logging stacks (e.g., CloudWatch, EFK, Prometheus, Grafana) to ensure system health and performance.Automation: Automate provisioning and configuration using Terraform/Terragrunt, Ansible, and GitOps workflows.Governance Resilience: Implement cloud governance, backup, disaster recovery, and cost optimization strategies.Collaboration: Partner with application and platform teams to support cloud adoption and ensure high availability.Incident Response: Troubleshoot incidents, perform root cause analysis, and drive continuous improvement.Documentation: Maintain clear, up-to-date architecture, operations, and automation documentation.Requirement:5+ years in cloud engineering, DevOps, or SRE roles.3+ years of hands-on AWS experience.Proficiency with Infrastructure-as-Code tools (Terraform, Ansible).Strong background in containerization and Kubernetes (EKS preferred).Solid experience with CI/CD and GitOps workflows (GitLab CI, ArgoCD).Hands-on experience with observability tools (EFK, Prometheus, Grafana).Strong understanding of cloud networking, IAM, and security best practices.Perks BenefitsCompetitive total compensation packageVarious team building programs and company eventsComprehensive healthcare schemes for employees and dependantsAnd many more! Apply and let us tell you more!#LI-JY1
Negotiable
Tidak ada persyaratan pengalaman kerja terkait
We are looking for an experienced Site Reliability Engineer (SRE) to build and maintain a reliable, scalable, and resilient platform infrastructure. The ideal candidate will have strong expertise in automation, infrastructure as code (IaC), monitoring, and system scalability.【Responsibility】Develop and maintain automated deployment pipelines using IaC tools (e.g., Ansible, Kubernetes, Jenkins, ArgoCD). Implement and manage monitoring and telemetry solutions with Prometheus and Grafana to ensure system visibility and performance optimization. Design and execute Disaster Recovery (DR) and backup strategies to enhance system resilience. Improve system scalability and reliability through automation and proactive performance optimizations. Continuously evolve the infrastructure by identifying and implementing improvements that enhance reliability and deployment speed.
On-Premise
DevOps
SRE
1.2 jt ~ 1.7 jt TWD / tahun
Diperlukan pengalaman selama 3 tahun
Tidak ada tanggung jawab manajemen

Cari Kerja di Cake

Gabung di Cake sekarang! Cari puluhan ribu lowongan kerja untuk mendapatkan pekerjaan idaman.