Beam is an ultrafast AI inference platform. We built a serverless runtime that launches GPU-backed containers in less than 1 second and quickly scales out to thousands of GPUs. Developers use our platform to serve apps to millions of users around the globe. We're backed by Y Combinator, Tiger Global, and prominent developer-tool founders, including the founder of Snyk and former CTO of GitHub.
About the Role
In this role, you'll build full-stack AI apps with our platform. You’ll build examples, demos, and sharable mini-apps that showcase the most interesting capabilities of AI — and you’ll use our infrastructure to do it. You’ll also optimize inference performance for a wide range of models running on our platform. You will minimize latency, maximize throughput, and experiment to make sure the apps running on our platform have industry-leading performance.
Your work will directly impact millions of users worldwide.
Skills & Experience
- Familiar with modern AI workflows, like ComfyUI and LoRA adaptors for fine-tuning
- Able to ship full-stack web apps, ideally using a modern stack like Python/Django and React/Next.js
- You’ve built something from scratch in the past, from wireframes to launching it publicly
- Interest in the modern AI/ML landscape: you’re experimenting with the newest models as soon as they’re released
- Experience with the modern inference stack (e.g., PyTorch, TensorRT, vLLM)
Benefits
- Work on challenging and impactful engineering problems
- Competitive salary and meaningful equity
- Join a fast-growing pre-Series A company at the ground floor
- Health, dental, and vision benefits with 100% coverage for employees and 50% for dependents
- Opportunities to participate in events across the cloud-native and AI communities
- Fitness stipend, learning budget, and much more