Cake Job Search

Advanced filters
Off
Taipei City, Taiwan
Full-time
Mid-Senior level
為什麼大家喜歡在 RichWell Co.Ltd. 上班? 1.彈性上班-早上不趕打卡,想多睡一點、避開通勤人潮都OK。2.特休多多-不用等滿一年就能休假,我們比法規更大方,放假就是要爽爽的。3.獎金福利讚 年終、績效獎金該有的都有,努力絕對不白費。4.生日小驚喜,公司記得你的每個重要時刻。5.定期聚餐/Team Building 不只是工作夥伴,更是一起成長的戰友,吃吃喝喝感情更緊密。6.技術課、內部分享會,想學什麼我們都支持,讓你持續進化不退化! About the roleWe are building a reliability-first platform. Over the next 12 months, we will stabilize our Windows-based services, strengthen observability, and progressively containerize into Kubernetes. You will be a key contributor driving self-service operations and data-driven reliability across the stack. What you’ll do• Operational automation: Build self-service runbooks for Windows services (AWX/Rundeck), implement Ansible/PowerShell DSC workflows, health checks, and safe rollbacks implementations.• Observability: Standardize metrics/logs/traces (Prometheus/Grafana, windows_exporter, OpenTelemetry; ELK/Loki). Create golden-signal dashboards and actionable alerts.• Reliability engineering: Participates in on-call, handle incidents and post-incident reviews (PIR), and lead game days to institutionalize SOPs.• Resilience: Design and implement backup disaster recovery, capacity planning, and performance tuning.• Long-term: Drive service containerization and Kubernetes adoption (Helm/Kustomize, Argo CD/Flux, ConfigMap/Secrets) with a strong focus on security and compliance.
Windows Server
Site Reliability Engineer
Prometheus/Grafana
1.6M ~ 2.2M TWD / year
4 years of experience required
No management responsibility
Established in 1987 and headquartered in Taiwan, TSMC pioneered the pure-play foundry business model with an exclusive focus on manufacturing its customers’ products. As of 2024, TSMC serves more than 500 customers and manufactures over 11,000 products for high-performance computing, smartphones, the Internet of Things (IoT), automotive, and digital consumer electronics. It is the world’s largest provider of logic ICs, with an annual capacity of 16 million 12-inch equivalent wafers. TSMC operates fabs in Taiwan as well as manufacturing subsidiaries in Washington State, Japan and China, and the Company began construction on a specialty technology fab in Dresden, Germany, in 2024. In Arizona, TSMC is building three fabs, with the first starting 4nm production in 2025, the second by 2028, and the third by the end of the decade. We are seeking outstanding engineers to join TSMC IT infrastructure team to build and operate IT advanced Data Center to support world-class semiconductor foundry.This team is responsible for designing, implementing and optimizing IT infrastructure towards software-defined computing, storage and networking with advanced cloud technologies.The successful candidate should have strong technical skills and dedication for operation excellence. Responsibilities: Your responsibilities include:1. Network/Storage Design and Management:(1) Design, construction, operation, and capacity planning of large-scale NAS storage/object storage. (2) Operate and manage network infrastructure, including LAN, WLAN, Firewall, and Proxy.(3) Design, implement, and manage scalable network architecture aligned with business goals and industry best practices. 2. Automation and Scripting:(1) Develop and maintain automation scripts for network configuration, monitoring, and management using tools like Ansible and Python. (2) Transform repeatable tasks into automation tools to streamline operations and maximize efficiency. 3. Application Development:(1) Develop state-of-the-art applications and refactor existing applications for improved performance and maintainability.(2) Write and implement tests (unit/feature/integration) to guarantee software integrity. 4. Monitoring and Troubleshooting:(1) Implement monitoring solutions to proactively identify and resolve network and application issues. (2) Perform root cause analysis and corrective actions to troubleshoot technical challenges, including Linux-related systems and logs (e.g., Go code/log analysis). (3) Infrastructure operation issues(network/server/storage/security) visualization and countermeasure planning. Additional information for the job:Job Location: Hsinchu Site, Taichung Site, Tainan Site, Taipei Office (Experienced Only)On-call needs: On-call 1 week every 3 months 1. Manager interview2. Hackerrank test3. On-site personality and English test(which could be replaced if you have script of an official English test)4. HR interview5. Second manager interview (Optional assessment)6. Technical review (Optional assessment)
台灣新竹市新竹
Negotiable
No requirement for relevant working experience
No management responsibility
Established in 1987 and headquartered in Taiwan, TSMC pioneered the pure-play foundry business model with an exclusive focus on manufacturing its customers’ products. As of 2024, TSMC serves more than 500 customers and manufactures over 11,000 products for high-performance computing, smartphones, the Internet of Things (IoT), automotive, and digital consumer electronics. It is the world’s largest provider of logic ICs, with an annual capacity of 16 million 12-inch equivalent wafers. TSMC operates fabs in Taiwan as well as manufacturing subsidiaries in Washington State, Japan and China, and the Company began construction on a specialty technology fab in Dresden, Germany, in 2024. In Arizona, TSMC is building three fabs, with the first starting 4nm production in 2025, the second by 2028, and the third by the end of the decade.As a platform engineer, you will focus on designing, implementing, and maintaining scalable features and services on the platform to support the productionization of applications that supportthe company’s RD/Fab/Business/IT/Security functions to improve the productivity and work quality. Responsibilities: Your responsibilities include:1. Automation and Scripting(1) Develop and maintain automation scripts for configuration, monitoring, and management using tools such as Ansible and Python.(2) Transform repeatable tasks into automation tools to streamline operations and maximize efficiency.(3) Implement Infrastructure as Code (IaC) to automate resource provisioning and CI/CD workflows. 2. Application Development (1) Develop scalable cloud-native microservice architectures for IT applications.(2) Develop state-of-the-art applications and refactor existing ones to improve performance and maintainability.(3) Apply software design principles, such as 12-factor app, to ensure sustainability and quality.(4) Write and implement tests (unit/feature/integration) to guarantee software integrity. 3. Monitoring and Troubleshooting (1) Implement monitoring solutions to proactively identify and resolve network and application issues.(2) Conduct root cause analyses and apply corrective actions to troubleshoot technicalchallenges, including Linux-related systems and logs (e.g., Go code/log analysis).(3) Lead evaluation and adoption of new IT technologies for continuous improvement. 4. (Optional) Network Design and Management (1) Operate and manage network infrastructure, including LAN, WLAN, Firewall, and Proxy.(2) Design, implement, and manage scalable network architecture aligned with business goals and industry best practices. Additional information for the job: Job Location: Hsinchu Site, Taichung Site, Tainan Site, Taipei Office (Experienced Only)On-call needs: On-call 1 week every 3 months The complete interview process includes: 1. Manager interview2. Hackerrank test3. On-site personality and English test (which could be replaced if you have script of an official English test)4. HR interview5. Second manager interview (Optional assessment)6. Technical review (Optional assessment)
Negotiable
5 years of experience required
No management responsibility
【公司介紹】這是一個具國際背景的軟體研發團隊,重視開放溝通、信任與自主文化,讓每位成員都能在彈性的環境中發揮專長並持續成長。團隊運作節奏靈活,鼓勵快速嘗試與學習,在穩定與效率之間取得平衡。 公司提供具競爭力的獎酬制度與完善福利,包含優於法規的休假安排、健康照護支持,以及多元的團隊活動與員工關懷措施。同時也打造舒適的工作環境與日常支持,讓你在專注工作的同時,也能兼顧生活品質與長期發展。【工作內容】 使用 Ansible 建立與維護系統自動化部署與設定管理流程 負責關鍵中介服務(如 Kafka、Tomcat、Redis)之維運、監控與效能優化 協助分析 Java 應用與相關服務異常,進行問題排查與初步定位 撰寫並維護技術文件、操作流程與系統維運手冊 與開發團隊合作,提升系統穩定性與運行效率
Linux
Kafka
Java
800K ~ 1.3M TWD / year
3 years of experience required
No management responsibility
❗️投遞履歷請一律至專屬的職缺網頁:https://25sprout.teamdoor.io/s/ML8ElGFS 目前此職缺為常態徵才,直接透過 Cake平台投遞將不會回覆唷 我們正在尋找一位 Mid-level SRE(Site Reliability Engineer),成為團隊的可靠後盾。你的任務是確保系統穩定運行、雲端環境高效管理、流程持續自動化,讓用戶體驗更順暢、工程師開發更專注。如果你熱愛新技術,喜歡動手解決問題,也樂於與不同角色協作,歡迎加入我們一起:) ▍你的工作將包括: Linux 作業系統管理與維運(RedHat / Debian / Ubuntu 等)網站/應用環境建置與維護(LAMP / LNMP)CI/CD 流程整合與最佳化(Jenkins / GitLab CI/CD)憑證、金鑰與機密管理(SSL/TLS、Vault 等)雲端平台資源管理(AWS EC2 / S3 / RDS、Azure、GCP 等)建置監控與告警系統,確保服務高可用性(Prometheus / Grafana / ELK)自動化工具與基礎架構即程式碼導入(Terraform / Ansible / CloudFormation)
60K ~ 70K TWD / month
3 years of experience required
No management responsibility
We are seeking a Site Reliability Engineer (SRE) specializing in networking. You will design, build, and automate our mission-critical, global-scale network infrastructure. Your mission is to ensure the reliability, performance, and scalability of our network services by applying software engineering principles to solve complex network challenges.【Job Responsibilities】1. Design Implementation: Design, build, and maintain our global core network infrastructure, including data center, cloud, and cross-regional connectivity.2. Automation Orchestration: Develop automation tools and scripts to manage network provisioning, configuration changes, monitoring, and troubleshooting to enhance efficiency and reliability.3. Reliability Performance: Define and monitor Service Level Indicators (SLIs) and Objectives (SLOs). Lead capacity planning and load testing to proactively identify and resolve system bottlenecks.4. Incident Management Improvement: Serve as a point of escalation for complex network failures, lead post-incident reviews, and transform lessons learned into permanent system improvements.5. Collaboration: Work closely with Platform, Product Development, and Security teams to provide a stable, high-performance, and secure network service.【Qualification】1. Bachelor's degree in Computer Science, Electrical Engineering, or a related field, or equivalent practical experience.2. 2+ years of experience in large-scale network management or in a network-focused role.3. A demonstrable portfolio or description of past projects, such as:4. Leading or significantly contributing to large-scale projects like data center network migrations or hybrid-cloud network architecture builds.5. Developing internal network automation systems, configuration management tools, or monitoring/alerting platforms.6. Optimizing network performance or reliability with concrete metrics to demonstrate improvement.【Skills】1. Core Networking:- Deep understanding of TCP/IP, BGP, OSPF, MPLS, VxLAN, and other core network protocols and technologies.- Hands-on experience with mainstream network equipment from vendors like Cisco, Juniper, F5 and Fortinet.2. Automation Programming:- Experience with automation frameworks like Ansible or Terraform.- Proficiency in at least one programming language such as Python or Go.- Familiarity with Git version control.3. Cloud Operating Systems:- Hands-on experience with public cloud networking (AWS, GCP or Azure).- Strong knowledge of Unix operating systems and their network stacks.4. SRE Practices:- Understanding of core SRE principles: SLA/SLO/SLI, error budgets, automation, and toil reduction.- Extensive experience with monitoring and observability tools like Prometheus or Grafana.5. Soft Skills:- Excellent problem-solving and analytical skills.- Strong communication and collaboration skills.- Self-motivated with a passion for learning and adopting new technologies.
SRE
80K ~ 120K TWD / month
3 years of experience required
No management responsibility
About BTSE:彼特思方舟 is a specialized service provider dedicated to delivering a full spectrum of front-office and back-office support solutions, each of which are tailored to the unique needs of global financial technology firms. 彼特思方舟 is engaged by BTSE Group to offer several key positions, enabling the delivery of cutting-edge technology and tailored solutions that meet the evolving demands of the fintech industry in a competitive global market.BTSE Group is a leading global fintech and blockchain company that is committed to building innovative technology and infrastructure. BTSE empowers businesses and corporate clients with the advanced tools they need to excel in a rapidly evolving and competitive market. BTSE has pioneered numerous trading technologies that have been widely adopted across the industry, setting new benchmarks for innovation, performance, and security in fintech. BTSE’s diverse business lines serve both retail (B2C) customers and institutional (B2B) clients, enabling them to launch, operate, and scale fintech businesses. BTSE is seeking ambitious, motivated professionals to join our B2C and B2B teams.About the Opportunity:We are seeking an experienced and highly skilled Senior Network Engineer to design, implement, and maintain enterprise network infrastructure that supports mission-critical business operations. This role requires deep technical expertise in networking technologies, strong problem-solving skills, and the ability to collaborate across teams to ensure secure, reliable, and high-performance connectivity.Responsibilities:Network Design ImplementationArchitect, configure, and deploy LAN, WAN, WLAN, and cloud networking solutions.Lead network upgrades, migrations, and integration projects.Operations MaintenanceMonitor, troubleshoot, and optimize network performance to ensure high availability.Manage switches, load balancers, and other security/network appliances.Maintain accurate network documentation, diagrams, and configurations.Security ComplianceImplement and enforce network security best practices, including segmentation, access control, and threat mitigation.Collaborate with security teams to ensure compliance with regulatory and corporate standards.Collaboration SupportPartner with SRE, IT team, cloud engineers, and application teams to deliver end-to-end solutions.Mentor junior engineers and contribute to knowledge-sharing initiatives.Continuous ImprovementEvaluate emerging technologies and recommend solutions to enhance scalability, resilience, and cost efficiency.Participate in disaster recovery and business continuity planning.Requirement:Bachelor’s degree in Computer Science, Information Technology, or related field (or equivalent experience).7+ years of hands-on experience in enterprise networking roles.Strong expertise in routing, switching, and network protocols (TCP/IP, BGP, OSPF, MPLS, etc.).Proficiency with firewalls, VPNs, and network security technologies ( Palo Alto, Fortinet, etc.).Experience with cloud networking (AWS, Azure, or GCP).Familiarity with network automation and scripting (Python, Ansible, etc.) is a plus.Relevant certifications (e.g., CCNP, CCIE, or equivalent) preferred.Excellent analytical, troubleshooting, and communication skills.Perks BenefitsCompetitive total compensation packageVarious team building programs and company eventsComprehensive healthcare schemes for employees and dependantsAnd many more! Apply and let us tell you more!#LI-JY1
Negotiable
No requirement for relevant working experience
About BTSE: 彼特思方舟 is a specialized service provider dedicated to delivering a full spectrum of front-office and back-office support solutions, each of which are tailored to the unique needs of global financial technology firms. 彼特思方舟 is engaged by BTSE Group to offer several key positions, enabling the delivery of cutting-edge technology and tailored solutions that meet the evolving demands of the fintech industry in a competitive global market. BTSE Group is a leading global fintech and blockchain company that is committed to building innovative technology and infrastructure. BTSE empowers businesses and corporate clients with the advanced tools they need to excel in a rapidly evolving and competitive market. BTSE has pioneered numerous trading technologies that have been widely adopted across the industry, setting new benchmarks for innovation, performance, and security in fintech. BTSE’s diverse business lines serve both retail (B2C) customers and institutional (B2B) clients, enabling them to launch, operate, and scale fintech businesses. BTSE is seeking ambitious, motivated professionals to join our B2C and B2B teams. About the Opportunity: We are looking for a dedicated and experienced Senior Infrastructure Developer to lead the design, implementation, optimization, and operation of our infrastructure environment. This role requires strong hands-on experience in cloud infrastructure, Linux systems, CI/CD, deployment automation, database operations, and low-latency system tuning. You will work closely with software engineers, platform teams, and other technical stakeholders to build and maintain a highly available, secure, scalable, and performance-driven infrastructure platform. This position also requires participation in an on-call rotation to support production systems and handle critical incidents. Candidates with experience in financial systems, trading platforms, digital assets, or cryptocurrency-related industries will be strongly preferred. Responsibilities:Design, build, maintain, and optimize the trading system infrastructure environment. Manage and improve AWS infrastructure, including EC2, networking, security, and OpenSearch. Build and maintain infrastructure automation and Infrastructure as Code practices. Design, implement, and support CI/CD pipelines, including GitLab and GitHub integration. Administer and optimize Linux environments, including system tuning, network tuning, and low-latency optimization. Support deployment, operation, and performance management of microservices. Manage multiple JVM-based services with a focus on performance, stability, and resource efficiency. Operate and optimize database platforms including ClickHouse, DolphinDB, and PostgreSQL. Design, deploy, and maintain the observability stack (Prometheus, Grafana, OpenTelemetry) from scratch to monitor system health, troubleshoot performance bottlenecks, and resolve production issues. Participate in on-call rotation and respond to production incidents in a timely manner. Collaborate with development and platform teams to improve system reliability, observability, and security.Requirement: Solid experience in Infrastructure Engineering, SRE, DevOps, Platform Engineering, or related roles. Expert-level AWS management skills, including: EC2 Networking Security OpenSearch Working experience with DolphinDB and/or PostgreSQL. Must-have experience with ClickHouse. Professional-level CI/CD experience, including: Ansible AWS Artifactory Git-based CI/CD integration using GitLab and GitHub Strong Linux expertise is required, including: Linux system administration Low-latency tuning Network tuning System tuning Proven ability to architect, install, and configure monitoring, alerting, and observability stacks from the ground up, specifically using Prometheus, Grafana, and/or OpenTelemetry. Hands-on experience with microservice management, including management of multiple JVM-based services. Hands-on experience with Infrastructure as Code. Willingness to join an on-call rotation and support production environments. Nice to Have: Experience in financial services, trading systems, capital markets, digital assets, or cryptocurrency-related industries. Familiarity with trading platforms, market data systems, matching engines, or low-latency infrastructure. Experience supporting high-availability, high-throughput, and low-latency production systems. Familiarity with DPDK (Data Plane Development Kit) is a strong plus. Strong experience in production incident handling and root cause analysis. Experience in infrastructure security, access control, and network architecture. What We Look For Strong ownership and the ability to lead infrastructure-related initiatives independently. Excellent troubleshooting and problem-solving skills. Ability to work effectively in a fast-paced and high-pressure environment. Strong focus on reliability, performance, and security. Good communication and cross-functional collaboration skills. Perks BenefitsCompetitive total compensation package Various team building programs and company events Comprehensive healthcare schemes for employees and dependants And many more! Apply and let us tell you more!#LI-JY1
Negotiable
No requirement for relevant working experience
【職位概述 Summary】DevOps 組長將負責領導 DevOps 團隊,主導 CI/CD(持續整合/持續交付)流程的設計、實施與優化。您是確保應用程式從開發、測試到部署和生產環境運行的「穩定性、可靠性(Reliability)與安全性(Security)」的關鍵人物,並負責推動 DevOps 文化在整個工程部門的落地與實踐。【主要職責 Responsibilities】1、團隊領導與管理- 帶領 DevOps 團隊成員,規劃工作項目與技術路線。- 制定 DevOps 流程與標準,推動跨部門協作(開發、測試、運維)。- 指導工程師執行自動化與基礎架構優化。2、CI/CD 流程建置與維運- 設計與實作持續整合、持續部署流程(CI/CD pipelines)。- 整合版本控制(Git)、自動化測試、部署與回滾機制。- 優化部署速度、可靠性與安全性。3、雲端與基礎架構管理- 管理雲端環境(AWS / Azure / GCP)與容器化架構(Docker / Kubernetes)。- 使用 IaC 工具(Terraform / Ansible / CloudFormation)自動化環境部署。- 監控資源使用與成本,確保效能與彈性。4、監控與系統穩定性- 建立系統監控、告警與日誌分析架構(如 Prometheus、Grafana、ELK)。- 主導事故處理(Incident Response)與問題根因分析(Root Cause Analysis)。- 確保服務高可用性(High Availability)與災難復原(Disaster Recovery)計畫。5、資訊安全與合規- 實施 DevSecOps 概念,將安全性融入 CI/CD 流程。- 管理憑證、金鑰與機密資料(Secrets Management)。- 確保系統符合資安政策與法規要求(如 ISO 27001、GDPR)。
1.2M ~ 1.8M TWD / month
5 years of experience required
Managing 1-5 staff
【Company Highlights】 致力於提供高效、創新的解決方案,滿足客戶在科技和商業上的各種需求。憑藉豐富的經驗和專業知識,支持客戶在數字化轉型、 軟體開發、IT基礎設施管理及其他關鍵領域的需求 除了技術精實穩定的團隊,也提供讚讚的福利! 包含:彈性上下班制度、優於法規的特休制度、具競爭力的獎金制度、定期聚餐與團隊活動、貼心的生日與節日驚喜、持續學習與成長支持, etc.【Responsibilities】 主導應用程式部署管線的開發與實作,採用 Infrastructure as Code (IaC) 工具,並著重使用 Ansible、Kubernetes (K8s) 以及透過 Jenkins/ArgoCD 建立 CI/CD 流程。與 CTO 及基礎架構團隊密切合作,制定技術策略,確保平台架構與產品路線圖一致。規劃與優化監控與遙測能力,採用 Prometheus/Grafana 技術堆疊及 OpenTelemetry 標準,確保系統全方位可觀測性。SRE 團隊 的技術藍圖與開發計畫(包含災難復原架構與執行)與公司整體願景與技術規劃對齊。透過自動化推動系統的可持續擴展,並持續優化系統架構,以提升可靠性、交付速度與運行效率。領導與指導 SRE 團隊,建立積極主動、責任感與資源掌控的團隊文化。
SRE
On-Premise
Kubernetes
1.8M ~ 2.5M TWD / year
6 years of experience required
Managing 1-5 staff

Cake Job Search

Join Cake now! Search tens of thousands of job listings to find your perfect job.