Home Job Details
N
Information Technology 🏒 Full Time ⭐️ Verified

Lead Generative AI Architect

Nebula Nexus AI
San Francisco
Estimated Salary
USD 180.000 – USD 250.000
Live Update
12 Mei 2026
Deadline
12 Mei 2027

Job Description

Are you ready to architect the next generation of artificial intelligence? Nebula Nexus AI is on a mission to redefine human-machine interaction in the year 2026 and beyond. We are seeking a visionary Lead Generative AI Architect to spearhead our R&D initiatives and build the foundational models that will power the future of enterprise.

In this pivotal role, you will bridge the gap between theoretical machine learning research and scalable, production-ready engineering. You will lead a world-class team of data scientists and engineers, defining the technical roadmap for our next-generation Large Language Models (LLMs) and multimodal systems.

Why Join Us?

  • Work at the forefront of the AI revolution with cutting-edge technology.
  • Competitive compensation package and equity options.
  • Flexible remote-first culture with hubs in SF and New York.

Responsibilities

  • Architect & Develop: Design and implement robust, scalable generative AI pipelines and LLM architectures from the ground up.
  • Technical Leadership: Guide the engineering team through the full software development lifecycle, ensuring code quality, scalability, and best practices.
  • Model Optimization: Focus on model inference optimization, quantization, and latency reduction to deploy models on edge devices and cloud environments.
  • R&D Strategy: Conduct cutting-edge research to explore new architectures, including diffusion models and transformer evolutions.
  • Collaboration: Partner with product managers and stakeholders to translate business requirements into technical specifications.

Qualifications

  • Education: Master’s or PhD in Computer Science, Mathematics, or a related field (PhD preferred).
  • Experience: 7+ years of experience in machine learning, deep learning, or software engineering.
  • Technical Skills: Deep expertise in Python, PyTorch, TensorFlow, or JAX. Proven experience building and deploying LLMs (e.g., GPT, LLaMA architectures).
  • Infrastructure: Strong understanding of MLOps, Kubernetes, and cloud infrastructure (AWS/GCP/Azure).
  • Problem Solving: Exceptional ability to solve complex algorithmic problems and optimize system performance under high load.

Required Skills

Python PyTorch TensorFlow LLM Machine Learning Deep Learning MLOps Natural Language Processing Kubernetes Cloud Computing CUDA Hugging Face GenAI

Ready to Take This Challenge?

Make sure your resume is ready. Submit your application now before the deadline.

Apply Now

Related Jobs

Similar job recommendations for you

View All