Home Job Details
N
Information Technology 🏒 Full Time ⭐️ Verified

Future AI Architect (Generative & Agentic Systems)

Nebula AI Systems
San Francisco
Estimated Salary
USD 180.000 – USD 280.000
New
Live Update
4 Juli 2026
Deadline
4 Jul 2027

Job Description

Shape the Future of Intelligence

Nebula AI Systems is at the forefront of the next technological revolution. We are seeking a visionary Future AI Architect to lead the development of autonomous, generative, and agentic AI systems. If you are passionate about pushing the boundaries of machine learning, large language models (LLMs), and autonomous agent frameworks, we want to meet you.

In this role, you will design and deploy the architectural foundations for our next-generation AI agents capable of complex reasoning and autonomous decision-making. You will work in a high-performance environment where innovation is the norm, and your work will directly impact the future of human-machine interaction.

Why Join Us?

  • Work on cutting-edge Agentic AI and LLM infrastructure.
  • Competitive compensation package with equity opportunities.
  • Flexible remote-first policy with state-of-the-art equipment.

Responsibilities

  • Design and architect scalable, high-performance AI systems focusing on Generative and Agentic workflows.
  • Lead the implementation of proprietary Large Language Models (LLMs) and fine-tuning strategies.
  • Optimize inference pipelines for low-latency, high-throughput production environments.
  • Collaborate with cross-functional teams of data scientists, engineers, and product managers.
  • Establish best practices for AI safety, security, and ethical AI deployment.
  • Stay ahead of industry trends in multimodal AI and autonomous agents.

Qualifications

  • PhD or Master’s degree in Computer Science, Mathematics, or a related field (or equivalent extensive experience).
  • 5+ years of experience in machine learning engineering, specifically with deep learning frameworks (PyTorch, TensorFlow).
  • Proven expertise in designing and deploying LLMs, RAG (Retrieval-Augmented Generation), and Agentic workflows.
  • Strong proficiency in Python, distributed systems, and cloud infrastructure (AWS/GCP).
  • Experience with model quantization, optimization, and serving technologies (vLLM, TensorRT, TGI).
  • Excellent problem-solving skills and a track record of delivering complex technical projects.

Required Skills

Python PyTorch TensorFlow Large Language Models (LLMs) Generative AI Agentic AI Machine Learning Engineering Distributed Systems AWS GCP Docker Kubernetes RAG Fine-tuning Model Optimization

Ready to Take This Challenge?

Make sure your resume is ready. Submit your application now before the deadline.

Apply Now

Related Jobs

Similar job recommendations for you

View All