Cake 找工作

進階搜尋
Off
- 職務說明 因應未來業務拓展、規劃,嘗試各種資訊解決方案設計、建置與維護 GCP 雲端基礎設施 (GKE, GCE, Cloud SQL 等),並專注於其可靠性、擴展性根據利害關係人或同事提出的問題,提供專業的意見,並協助解決問題與公司內部決策者合作,了解其目標,研究並提供解決方案管理 CDN、DNS 等網路基礎服務,確保外部連線的穩定與快速監控、告警與日誌系統的建置與優化CI / CD 設定與維護新技術研究簡單的電腦設備管理
5萬 ~ 8萬 TWD / 月
需具備 1 年以上工作經驗
不需負擔管理責任
Astera Labs (NASDAQ: ALAB) provides rack-scale AI infrastructure through purpose-built connectivity solutions. By collaborating with hyperscalers and ecosystem partners, Astera Labs enables organizations to unlock the full potential of modern AI. Astera Labs’ Intelligent Connectivity Platform integrates CXL®, Ethernet, NVLink, PCIe®, and UALink™ semiconductor-based technologies with the company’s COSMOS software suite to unify diverse components into cohesive, flexible systems that deliver end-to-end scale-up, and scale-out connectivity. The company’s custom connectivity solutions business complements its standards-based portfolio, enabling customers to deploy tailored architectures to meet their unique infrastructure requirements. Discover more at www.asteralabs.com.We are seeking a skilledSenior DevOps Engineer to join our Silicon Engineering Infrastructure team. In this role, you will be instrumental in building, maintaining, and optimizing cloud-based infrastructure that supports our semiconductor design and verification workflows. You will work closely with silicon engineering teams to ensure reliable, scalable, and efficient compute environments. Key Responsibilities Design, deploy, and maintain cloud infrastructure onAWSto support silicon engineering workloads Manage and optimizeEC2instances,FSxfile systems, and related AWS services for high-performance computing needs Implement and manageAWS ParallelClusterfor provisioning and scaling compute clusters and partitions Troubleshoot and resolve complex infrastructure issues across cloud and on-premises environments Develop automation scripts and Infrastructure-as-Code (IaC) solutions to streamline operations Collaborate with EDA tool administrators and silicon engineers to optimize workflows and resource utilization Monitor system performance, implement alerting, and ensure high availability of critical infrastructure Document processes, runbooks, and best practices for team knowledge sharing What You'll Bring A proactive, self-motivated approach to identifying and solving infrastructure challenges Strong communication skills to collaborate with cross-functional engineering teams Ability to work in a fast-paced environment with competing priorities Passion for automation and continuous improvement Required Qualifications 3+ yearsof hands-on DevOps/Infrastructure engineering experience Strongproblem-solving skillswith the ability to debug complex system issues Solid operational knowledge ofAWS Cloudservices, including: EC2(instance management, AMIs, spot/on-demand strategies) FSx(Lustre/NetApp ONTAP for high-performance storage) VPC, Security Groups, IAM, and networking fundamentals Experience with scripting languages such asPython,Bash, or similar Experience with AI based tools like claude-code or copilot a plus Familiarity with Infrastructure-as-Code tools (Terraform, CloudFormation, or Ansible) Experience with CI/CD pipelines and version control systems (Git) Preferred Qualifications Experience withAWS ParallelClusteror similar HPC cluster management tools Background in theSemiconductor/EDA industrywith understanding of: EDA tool workflows (simulation, synthesis, place route, verification) License management and job scheduling (LSF, Slurm, SGE) Debug scenarios specific to silicon design environments Knowledge of container technologies (Docker, Singularity) Experience with monitoring and observability tools (CloudWatch, Prometheus, Grafana) AWS certifications (Solutions Architect, SysOps Administrator) are a plus Nice to Have Experience with hybrid cloud architectures (on-prem + cloud) Familiarity with cost optimization strategies for large-scale cloud deployments Understanding of security best practices in regulated environments Salary range is USD $148,500 to USD $165,000 at Senior Level. USD $175,500 to USD $195,000 at Senior Level. depending on experience, level, and business need. This role may be eligible for discretionary bonus, incentives and benefits. We know that creativity and innovation happen more often when teams include diverse ideas, backgrounds, and experiences, and we actively encourage everyone with relevant experience to apply, including people of color, LGBTQ+ and non-binary people, veterans, parents, and individuals with disabilities.
🚀 DevOps 雲端工程師|打造工程師最想要的開發流程 不是所有英雄都穿披風,有些拿著 YAML、Terraform 和 GitHub Actions 💻 我們在找的就是能讓服務「穩定得像地心引力」的你 🌍 🛠️ 工作內容維護與優化 CI/CD pipeline,自動部署無痛又穩定架設與維運雲端基礎設施(AWS/GCP/Azure)撰寫與管理 Infrastructure as Code架設系統監控與異常警示,主動防災不等天災協助資安防護與備援策略規劃 🔍 我們希望你具備熟悉 Linux + Docker/Kubernetes了解雲平台生態,擁有實戰經驗更讚熟練 GitOps 精神與 CI/CD 工具鏈願意與開發、產品密切合作解決問題 👀 你可能是這樣的人腦中自帶自動化思維,討厭「手動操作」這四個字🧠解 bug 當拼圖,越難越想解🧩喜歡把混亂流程變成優雅系統🔥永遠在學習新技術,有新工具一定想先試試📚 我們相信,一套順手的開發流程,可以讓團隊效率提升 10 倍。來吧,幫大家從痛苦部署地獄中解救出來,從此專心做更有趣的事! 💡時薪範圍:500元/小時起,有能力爭取,還可以更高 💡工作地點:主要為遠端辦公,實際配合方式會與企業達成共識,彈性十足!快組隊平台連結企業與自由接案者,讓你自由選擇適合的案件,不僅能拓展你的技能,還能輕鬆掌控工作時間!現在就加入我們,解鎖更靈活、更高效的工作模式吧!投遞後,為了希望大家更細節認識我們後再決定是否加入, 我們會先邀請您到線上說明會,聽完確定跟您的工作期待有對齊即可填寫會中的會後表單, 我們審查通過後將帶您進入面試流程 。我們期待與您相遇,共創專業價值!
500 ~ 1500 TWD / 小時
需具備 3 年以上工作經驗
不需負擔管理責任
【Company Information】 這是一個專注於下一代 AI 運算架構的國際化新創團隊,核心技術圍繞在 GPU 資源優化、分散式運算以及高效能運算平台(HPC)。團隊致力於解決大型 AI 模型在訓練與推論過程中的效能與成本瓶頸,打造更具彈性與效率的運算解決方案。 公司目前正處於快速成長階段,產品已開始進入企業級應用場景,並與多方產業資源建立合作關係,布局未來 AI 基礎設施市場。團隊成員背景多元,涵蓋大型科技製造、雲端平台與 AI 領域,具備從底層硬體到軟體平台的整合能力。 在這裡,你將有機會參與從技術定位、產品落地到市場拓展的關鍵過程,直接影響產品如何被理解、採用並規模化,適合喜歡在 0→1、1→N 階段發揮影響力的人才。 【Responsibilities】AI Infrastructure Container Platform Development Design and optimize system architecture for AI-driven container platforms, supporting large-scale workloads such as model training, inference, and scheduling Develop and enhance Kubernetes-based components, including container runtimes, storage systems, and networking layers tailored for high-performance AI environments Build and improve core platform capabilities such as monitoring, logging, alerting, and auditing to ensure strong observability and system reliability Distributed Systems Performance Optimization Contribute to the development of distributed computing components that support heterogeneous workloads and scalable AI infrastructure Optimize system performance across compute, storage, and networking to improve efficiency and cost-effectiveness Support the evolution of unified platforms for both training and inference workloads Platform Stability Product Delivery Ensure platform stability, scalability, and operational excellence for enterprise-grade AI infrastructure Collaborate with cross-functional teams to deliver robust and production-ready solutionsContinuously improve system architecture based on real-world usage and performance insights
Prometheus
Kubernetes
Grafana
100萬 ~ 300萬 TWD / 年
需具備 3 年以上工作經驗
不需負擔管理責任
為什麼大家喜歡在 RichWell Co.Ltd. 上班? 1.彈性上班-早上不趕打卡,想多睡一點、避開通勤人潮都OK。2.特休多多-不用等滿一年就能休假,我們比法規更大方,放假就是要爽爽的。3.獎金福利讚 年終、績效獎金該有的都有,努力絕對不白費。4.生日小驚喜,公司記得你的每個重要時刻。5.定期聚餐/Team Building 不只是工作夥伴,更是一起成長的戰友,吃吃喝喝感情更緊密。6.技術課、內部分享會,想學什麼我們都支持,讓你持續進化不退化! About the roleWe are building a reliability-first platform. Over the next 12 months, we will stabilize our Windows-based services, strengthen observability, and progressively containerize into Kubernetes. You will be a key contributor driving self-service operations and data-driven reliability across the stack. What you’ll do• Operational automation: Build self-service runbooks for Windows services (AWX/Rundeck), implement Ansible/PowerShell DSC workflows, health checks, and safe rollbacks implementations.• Observability: Standardize metrics/logs/traces (Prometheus/Grafana, windows_exporter, OpenTelemetry; ELK/Loki). Create golden-signal dashboards and actionable alerts.• Reliability engineering: Participates in on-call, handle incidents and post-incident reviews (PIR), and lead game days to institutionalize SOPs.• Resilience: Design and implement backup disaster recovery, capacity planning, and performance tuning.• Long-term: Drive service containerization and Kubernetes adoption (Helm/Kustomize, Argo CD/Flux, ConfigMap/Secrets) with a strong focus on security and compliance.
Windows Server
Site Reliability Engineer
Prometheus/Grafana
160萬 ~ 220萬 TWD / 年
需具備 4 年以上工作經驗
不需負擔管理責任
About OptiSigns OptiSigns is a fast-scaling cloud platform powering digital signage for 35,000+ businesses across 100+ countries, with 200,000+ active screens worldwide. Founded in Houston, Texas in 2016 and now expanding aggressively in Asia and Europe, we help companiestransform ordinary screens into powerful, dynamic communication tools. Our Vietnam engineering team is central to our next phase of growth. Why This Role This is not a typical architect role. We are looking to bring on a senior software engineer from Taiwan to relocate to Ho Chi Minh City, Vietnam, and lead our growing engineering hub. This is a hands-on technical leadership role, with real ownership over both system scalability and team development. You will lead by example—mentoring engineers, raising the technical bar, and ensuring the team can move fast while building reliable systems. You will own platform scalability at real-world scale: 100M+ database records TB-scale customer data Rapidly growing global traffic Your mission is to ensure the system scales reliably while enabling the team to ship faster and with confidence. This role includes a full relocation package and offers a global career path, including the opportunity to work from our US headquarters as part of our rotation program. What You’ll Do Architect and scale backend systems handling massive datasets and high concurrencyOwn end-to-end performance, reliability, and scalability across the platformOptimize databases, data pipelines, caching layers, and backend servicesProactively identify and eliminate bottlenecks before they impact customersMentor and grow a high-performing engineering team in Ho Chi Minh City and in Houston TX,USCollaborate directly with leadership on long-term technical strategy and roadmap Why Join OptiSigns High-impact role shaping the architecture of a globally used product (200,000+ screens)Real-world scaling challenges with tangible business impactStrong ownership, high visibility, and direct influence on technical directionOpportunity to build and lead a growing engineering team in VietnamFully supported relocation from Taiwan What We Offer Competitive salary: TWD 2.0M – 3.0M per year Performance bonus: 15–25% based on impactOpportunity for global rotation program (work in our US office after 1+ year of strong performance)Full relocation package (visa sponsorship, flights, Temp housing allowance, onboarding support)13th-month salary, comprehensive health insurance, and standard Vietnam benefits
SRE devops
software engineering
ODM/OEM management
200萬 ~ 300萬 TWD / 年
需具備 8 年以上工作經驗
管理人數未定
About OptiSigns OptiSigns is a fast-scaling cloud platform powering digital signage for 35,000+ businesses across 100+ countries, with 200,000+ active screens worldwide. Founded in Houston, Texas in 2016 and now expanding aggressively in Asia and Europe, we help companies transform ordinary screens into powerful, dynamic communication tools. Our Vietnam engineering team is central to our next phase of growth. Why This Role This is not a typical engineer role. We are looking to bring on a Chinese speaking Embedded Android Lead engineer to relocate to Ho Chi Minh City, Vietnam or Houston TX, USA,and lead our growing engineering hub. This is a hands-on technical leadership role, with real ownership over both system scalability and team developmentand lead firmware and OS layer of our digital signage platform. You will be responsible for building and maintaining a custom Android (AOSP-based) system running on Rockchip devices. This role includes a full relocation package and offers a global career path, including the opportunity to work from our US headquarters as part of our rotation program. What You’ll Do Build and customize Android (AOSP) for embedded devicesWork with BSP and vendor SDKs (Rockchip)Optimize system performance for media playback and multi-displayOwn firmware lifecycle: build, release, maintenanceImplement OTA update systemsWork closely with hardware/ODM teams during bring-upIntegrate drivers, HALs, and system servicesEnable remote diagnostics and debugging Why Join OptiSigns High-impact role shaping the architecture of a globally used product (200,000+ screens)Real-world scaling challenges with tangible business impactStrong ownership, high visibility, and direct influence on technical directionOpportunity to build and lead a growing engineering team in VietnamFully supported relocation from Taiwan What We Offer Competitive salary: TWD 2.0M – 3.0M per year Performance bonus: 15–25% based on impactOpportunity for global rotation program (work in our US office after 1+ year of strong performance)Full relocation package (visa sponsorship, flights, Temp housing allowance, onboarding support)13th-month salary, comprehensive health insurance, and standard Vietnam benefits
System Architecture & Design
SRE devops
Software Engineering
200萬 ~ 300萬 TWD / 年
需具備 8 年以上工作經驗
管理人數未定
我們從計程車叫車 App 出發,55688 App 已突破 720 萬會員、累積超過 100 萬次下載,並維持 4.8 星高評價。隨著服務擴展至快遞、找專家、洗衣等生活服務,我們正朝向能承載高即時流量與高可靠度需求的 Super App 邁進。 目前團隊已具備研發與第一線維運人員,正在建立 SRE(可靠度工程)能力,希望邀請對系統穩定性、工程化改善有熱情的工程師,一起把基礎打好、制度建起來。 一、職務定位 1. 負責維持系統在 7x24x365 營運模式下的穩定性、可用性與可擴展性,透過工程化方式降低事故發生率、縮短復原時間,並建立自動化、標準化的部署與維運流程,使系統能安全、快速、可預期地持續交付。同時與研發工程師密切合作,將穩定度、可維運性與交付能力內建於產品開發流程中。 2. 這是一個 SRE / DevOps 的探索與建設角色(0→1),我們不期待你一來就建立完整 SRE 體系,能與團隊逐步建立可靠度工程的基礎能力與共識。 二、Incident / on-call 分工說明 1. L1(第一線)即時應變:由維運人員負責。 2. 本職位為 L2 on-call 支援角色,專注在可靠度與穩定性。 3. 核心價值在於: * 事後改善。 * 制度建立。 * 用工程方式降低事故發生率與影響範圍。 三、你會做的事(工作內容) (一) SRE(可靠度工程|L2) 1. 與團隊一起盤點關鍵服務,逐步導入服務可靠度目標: * Service Level Agreement * Service Level Objective * Error Budget 2. 協助設計與改善系統架構: * 高可用架構(Load Balancer、Auto Scaling、Failover)。 * 健康檢查與自動復原機制。 3. 進行容量規劃與壓力評估: * Capacity Planning。 * 事前評估壅塞與資源不足風險。 4. 建立與優化可觀測性(Observability): * Metrics(CPU、Memory、QPS、Latency、Error Rate) * Logs(集中化日誌) * Tracing(分散式追蹤) 5. 設計合理告警策略: * 避免大量無效或過度頻繁告警。 * 讓告警更貼近實際風險與業務影響。 6. 參與 L2 on-call 支援: * 協助分析系統性問題與 Root Cause。 * 評估是否需要: a. 回滾版本。 b. 降級服務。 c. 進行跨系統處置。 7. 主導或協助完成 Incident Report 與 Postmortem: * 系統性整理事故過程與影響。 * 將每一次事故轉化為具體改善行動與制度。 * 追蹤改善措施的落實情況。 (二) DevOps 1. 建立與維護 CI/CD Pipeline: * 例如 Jenkins、GitLab CI、GitHub Actions。 * 確保流程穩定、可重複且易維護。 2. 將以下流程自動化,降低人工操作風險: * Build。 * Test。 * Security Scan。 * Deploy。 3. 支援多環境的一致性與部署效率: * Dev 環境。 * Staging 環境。 * Production 環境。 4. 導入 Infrastructure as Code: * 例如 Terraform。 * 提升環境管理與佈署的可重現性與可追蹤性。 5. 建立與完善發布與回復機制 6. 與 QA、RD 協作: * 透過流程與工具設計降低發版風險。 * 在速度與穩定之間取得平衡。 (三) 與研發與維運團隊協作 1. 與 RD 協作,將穩定度與可觀測性納入開發流程,例如: * 設計 Health Check 機制,讓系統狀態可被自動偵測與監控。 * 規劃服務降級與備援設計,確保在部分功能異常時,核心流程仍可運作。 * 持續消除單點故障(SPOF),提升整體架構的高可用性。 2. 提供標準化平台能力,讓各產品團隊能共用: * CI/CD Pipeline 範本。 * 監控標準模組。 * 告警標準規則。 3. 與研發與維運團隊共同建立基礎 SRE 實踐: * Incident handling 流程: a. 通報。 b. 應變。 c. 復原。 * Runbook 撰寫與持續改善: a. 讓常見情境有標準作業手冊可依循。 * 基本 SLO / Error Budget 導入與追蹤。 4. 透過文件、分享與實務協作: * 提升團隊對 SRE 思維與方法的理解。 * 建立跨團隊對穩定度的共同語言與共識。 四、我們期待你具備的條件 (一) 必備條件 1. 3–5 年以上 DevOps 或 SRE 相關實務經驗。 2. 熟悉作業系統與網路基礎: * TCP/IP。 * DNS。 * HTTP。 * Load Balancer 等相關概念。 3. 熟悉至少一種雲端平台: * 例如 GCP 或 Azure。 4. 熟悉容器與編排技術 5. 具備 CI/CD Pipeline 建置或維護經驗。 6. 熟悉或曾接觸 Observability 工具,例如: * Prometheus / Grafana。 * ELK(Elasticsearch / Logstash / Kibana)。 * Datadog。 * OpenTelemetry 等。 7. 能配合 L2 on-call 支援: * 接受輪值制度。 * 願意以工程方式持續降低 on-call 負擔與頻率。 8.具領導資淺同仁、指派工作經驗,協同完成工作內容。 (二) 加分條件 1. 有即時高流量系統經驗(即時服務、電商、金流)。 2. 具效能調校、容量規劃或壓力測試實務經驗。 3. 具雲端或平台資安實務經驗,例如: * 權限設計。 * 資安防護。 * 合規與稽核相關經驗。 這不是一個「只是在前線救火」的職位, 而是一個能與團隊一起把 SRE 能力與制度從 0 建起來的角色。 如果你喜歡把混亂變成秩序、 把事故變成制度、 把人力應變變成工程化改善, 我們會很期待和你聊聊。
5萬 ~ 8萬 TWD / 月
需具備 5 年以上工作經驗
不需負擔管理責任
▍團隊介紹:你的未來夥伴 我們不只是在修補漏洞,我們是在建構「原生安全」的產品環境。資安工程師是我們技術團隊中的特種部隊,負責在駭客行動前找出弱點,並透過工程手段(Engineering)將防禦自動化。我們與 SRE、DevOps 團隊深度整合,共同打造高韌性的雲端架構。 ▍團隊文化:我們如何工作 - 實戰導向(Hands-on Mentality)我們重視程式碼與系統實作,不只是看報表。我們鼓勵同仁參與 CTF 比賽、挖掘漏洞,並將研究成果轉化為公司的防禦工具。- 自動化優先(Security as Code)手動掃描是過去式。我們追求將安全檢測整合進 CI/CD Pipeline,實現自動化 DAST/SAST,讓安全檢查如同單元測試一般自然。- 紅藍軍對抗(Red Blue Teaming)我們維持內部持續性的演練文化。透過紅軍(攻擊)發現盲點,藍軍(防禦)強化監控與回應機制,在真實威脅發生前完成進階部署。- 技術共好與分享(Knowledge Sharing)資安領域變化極快,我們每週舉行技術拆解會,分析最新 CVE 漏洞原理或新型攻擊載體,確保團隊始終站在技術前沿。 ▍關於職務:你將負責的工作內容 1. 滲透測試與弱點分析:執行 Web、API 及雲端架構的滲透測試,並提供具體的程式碼修復建議。2. DevSecOps 流程建置:在開發流程中嵌入安全自動化工具,建立漏洞自動化掃描與管理平台。3. 入侵偵測與應變(IR):設計與維護 SOC 告警規則,主導重大資安事故的技術鑑定與數位鑑識。4. 雲端安全加固:針對 K8s 容器安全、雲端權限(IAM)及網路架構進行深度硬化(Hardening)。5. 安全工具開發:使用 Go 或 Python 開發內部專用的資安輔助工具或自動化檢測腳本。6. 特權帳號管理(PAM):確保所有高權限操作皆有稽核軌跡(Audit Trails)。7. 資安自動化:開發自動化工具時,須確保工具本身的安全性(Secure coding for security tools)。 ▍我們提供的福利與環境 - 薪資範圍:月薪 NTD $70,000 - $120,000(依技術實力討論,資深者可面議)。- 戰力補助:全額補助資安證照考試費用(如 OSCP 等高階認證)、國內外資安大會門票、AI工具補助。- 工作彈性:遠距工作、彈性上下班。- 資安人專屬:提供高性能開發工作站與測試環境、不定期舉辦內部技術「攻防賽」、定期部門生日會、活動、聚餐。
7萬 ~ 12萬 TWD / 月
需具備 3 年以上工作經驗
不需負擔管理責任
The role works with a high level of autonomy and discretion, assists Team Lead of eCommerce Platform to develop a modern and high-performance eCommerce platform to support business growth and solve business issues, in line with Amway technology strategic direction. Responsibilities include working together with the teammates of product management and experience design to drive eCommerce implementation and continuous evolution, as well as providing strong operational supports to ensure seamless customer experiences.Work in the agile-based team to develop planned enhancement, maintain good communication and relationships with internal and external key stakeholders to ensure the delivery of eCommerce platform that meet business needs in each Sprint. Fully responsible for end-to-end eCommerce platform delivery, which extends to planning, design, development, delivery, and operations.(Job Description)Develops User-centric (Customer, ABO Internal Staff) products and platforms through collaboration with cross-functional teams within technology organization and business functions, including:⚫ Developing a modern and high-performance eCommerce platform with high availability and scalability, providing excellent end-to-end shopping experiences, covering online and offline channels, through the updated development framework to fulfill commercial growth plan⚫ Comfortable working in an environment that practices agile development, engaging Product Management, Experience Design, the other Technology and Business Stakeholders⚫ Engage in non-functional requirements like scalability, security, stability, and performance for the platform and collaborate with Security, DevOps, and QA teammates to achieve platform goal.⚫ Familiar with Domain-Driven Design (DDD) and microservice design patterns such as CQRS for technical design⚫ Experience with AWS, containerized deployments using Kubernetes/Docker, and working in a DevOps environment, backend REST API development⚫ Guild and design in adopting Microservices, Middleware Container architecture for platform building⚫ Work with the solution architect to establish an engineering process with best practice in Design, Planning, Code, Test, Release and Monitoring⚫ Experience in Java and Spring framework and develop codes according to best practices for software development⚫ Responsible for development management, design, and code reviews⚫ Contribute in optimizing design, code, and defining Unit and Functional test automation strategy and framework adoption⚫ Good experience in debugging, understanding and application of data structures, ability to quickly read through others code⚫ Guide in direct root cause analysis of critical business and production issues⚫ Establish and induct secure coding practices using OWASP and ensure zero vulnerability⚫ Establish monitoring strategy principles in line with product usage bottlenecks, identify solve tech debts with an actionable plan⚫ Bring innovative ideas for platform continuous enhancement.⚫ Align solutions with the overall business applications and technology roadmap⚫ Perform other related duties as assigned.⚫歡迎身心障礙者
80萬 ~ 110萬 TWD / 年
需具備 10 年以上工作經驗
不需負擔管理責任

Cake 找工作

加入 Cake 社群,搜尋上萬筆職缺,快速找到適合你的工作。