Posted on 7th November 2025
Expires on 6th January 2026
Negotiable
我们正在为全球用户构建 AI 系统。当前正处于 AI 变革的关键时期 —— 本项目团队致力于构建能够真正落地、创造现实世界最大影响与使用量的 AI 应用。
该职位为全球岗位,采用灵活混合办公模式 —— 结合远程办公与总部现场协作。你将与产品、工程、运营、基础设施和数据等地区团队紧密合作,共同构建并扩展具有影响力的 AI 解决方案。
你将运行并优化最前沿的开源模型、设计推理框架,并将 AI 功能稳定上线。你的工作将确保我们的模型不仅具备智能,还能在规模化场景中保持安全性、可靠性与性能表现。
熟悉 GPU 调度与资源编排,掌握 Kubernetes、Ray、Modal、RunPod、LambdaLabs 等工具
我们是一支高密度、高绩效的团队,专注于高质量产品与全球影响力。我们像主人一样承担责任,重视速度、清晰与极致执行。如果你渴望成长并追求卓越,欢迎加入我们!
BJAK 是东南亚最大的保险聚合平台,服务用户超过 800 万,且由员工全资持股。公司总部位于马来西亚,在泰国、台湾与日本设有业务。我们通过 Bjak.com">Bjak.com 帮助数百万用户获取透明且可负担的金融保障。
我们通过 API、自动化与 AI 等前沿科技,简化复杂金融产品,致力于打造下一代智能金融系统。
如果你对构建真正落地的 AI 系统充满热情,并希望在高影响力环境中快速成长,我们期待与你相遇!
Transform Language Models into Real-World Applications
We’re building AI systems for a global audience. We are living in an era of AI transition - this new project team will be focusing on building applications to enable more real world impact and highest usage for the world.
This role is a global role with hybrid work arrangement - combining flexible remote work with in-office collaboration at our HQ. You’ll work closely with regional teams across product, engineering, operations, infrastructure and data to build and scale impactful AI solutions.
You’ll fine-tune state-of-the-art models, design evaluation frameworks, and bring AI features into production. Your work ensures our models are not only intelligent, but also safe, trustworthy, and impactful at scale.
Run and manage open-source models efficiently, optimizing for cost and reliability
Ensure high performance and stability across GPU, CPU, and memory resources
Monitor and troubleshoot model inference to maintain low latency and high throughput
Collaborate with engineers to implement scalable and reliable model serving solutions
Believe clarity comes from action - prototype, test, and iterate without waiting for perfect plans.
Stay calm and effective in startup chaos - shifting priorities and building from zero doesn’t faze you.
Bias for speed - you believe it’s better to deliver something valuable now than a perfect version much later.
See feedback and failure as part of growth - you’re here to level up.
Possess humility, hunger, and hustle, and lift others up as you go.
Experience with model serving platforms such as vLLM or HuggingFace TGI
Proficiency in GPU orchestration using tools like Kubernetes, Ray, Modal, RunPod, LambdaLabs
Ability to monitor latency, costs, and scale systems efficiently with traffic demands
Experience setting up inference endpoints for backend engineers
Full involvement in direction and consensus decision making
High-impact role with visibility across product, data, and engineering
Top-of-market compensation and performance-based bonuses
Lots of perks - housing rental subsidies, a quality company cafeteria, and overtime meals
Global travel insurance (for you & your dependents)
We’re a densed, high-performance team focused on high quality work and global impact. We behave like owners. We value speed, clarity, and relentless ownership. If you’re hungry to grow and care deeply about excellence, join us.
BJAK is Southeast Asia’s #1 insurance aggregator with 8M+ users, fully owned by its employees. Headquartered in Malaysia and operating in Thailand, Taiwan, and Japan, we help millions of users access transparent and affordable financial protection through Bjak.com">Bjak.com. We simplify complex financial products through cutting-edge technologies, including APIs, automation, and AI, to build the next generation of intelligent financial systems.
If you're excited to build real-world AI systems and grow fast in a high-impact environment, we’d love to hear from you.