Job Description
We are looking for a visionary Senior AI/ML Engineer to architect the next generation of generative intelligence systems. As we gear up for our major 2026 product launch, we need a technical leader who can bridge the gap between theoretical research and production-scale deployment. You will define the AI roadmap that will power our enterprise solutions for the next decade.
Why join us?
- Impact: Directly influence the core architecture of the AI models that will define our industry in 2026.
- Equity: Competitive stock options package for early contributors.
- Environment: Work in a state-of-the-art facility in the heart of San Francisco's tech district.
If you are passionate about Large Language Models (LLMs), computer vision, and building systems that scale, we want to meet you.
Responsibilities
- Architect and train proprietary Large Language Models (LLMs) with a focus on efficiency and accuracy.
- Optimize inference pipelines to support high-volume, low-latency production environments.
- Collaborate with cross-functional teams to integrate AI models into user-facing products.
- Mentor junior engineers and data scientists, fostering a culture of technical excellence.
- Stay ahead of 2026 AI trends, researching and evaluating new architectures like Mamba or Vision Transformers.
- Implement robust MLOps pipelines for continuous training and deployment.
Qualifications
- Masterβs or PhD in Computer Science, Mathematics, or a related quantitative field.
- 5+ years of professional experience in AI/ML engineering, with at least 2 years focused on Generative AI.
- Deep proficiency in Python, PyTorch, or TensorFlow.
- Experience with cloud infrastructure (AWS, GCP, or Azure) and containerization (Docker, Kubernetes).
- Strong understanding of NLP, LLM fine-tuning, and RAG (Retrieval-Augmented Generation).
- Proven track record of deploying machine learning models to production at scale.