【Company Information】
這是一個專注於下一代 AI 運算架構的國際化新創團隊,核心技術圍繞在 GPU 資源優化、分散式運算以及高效能運算平台(HPC)。團隊致力於解決大型 AI 模型在訓練與推論過程中的效能與成本瓶頸,打造更具彈性與效率的運算解決方案。
公司目前正處於快速成長階段,產品已開始進入企業級應用場景,並與多方產業資源建立合作關係,布局未來 AI 基礎設施市場。團隊成員背景多元,涵蓋大型科技製造、雲端平台與 AI 領域,具備從底層硬體到軟體平台的整合能力。
在這裡,你將有機會參與從技術定位、產品落地到市場拓展的關鍵過程,直接影響產品如何被理解、採用並規模化,適合喜歡在 0→1、1→N 階段發揮影響力的人才。
【Responsibilities】AI Infrastructure Container Platform Development
Design and optimize system architecture for AI-driven container platforms, supporting large-scale workloads such as model training, inference, and scheduling
Develop and enhance Kubernetes-based components, including container runtimes, storage systems, and networking layers tailored for high-performance AI environments
Build and improve core platform capabilities such as monitoring, logging, alerting, and auditing to ensure strong observability and system reliability
Distributed Systems Performance Optimization
Contribute to the development of distributed computing components that support heterogeneous workloads and scalable AI infrastructure
Optimize system performance across compute, storage, and networking to improve efficiency and cost-effectiveness
Support the evolution of unified platforms for both training and inference workloads
Platform Stability Product Delivery
Ensure platform stability, scalability, and operational excellence for enterprise-grade AI infrastructure
Collaborate with cross-functional teams to deliver robust and production-ready solutionsContinuously improve system architecture based on real-world usage and performance insights