Home Job Details
O
Information Technology 🏒 Full Time ⭐️ Verified

Senior AI Engineer (2026 Vision)

OmniStream AI
San Francisco
Estimated Salary
USD 190.000 – USD 260.000
New
Live Update
1 Juni 2026
Deadline
1 Jun 2027

Job Description

Are you ready to define the technology landscape of 2026?


OmniStream AI is seeking a visionary Senior AI Engineer to lead our next-generation generative AI initiatives. As we accelerate towards the future of autonomous agents and hyper-personalized user experiences, we need a technical leader who thrives on complexity and innovation.


Join a team of world-class researchers and engineers dedicated to pushing the boundaries of Large Language Models (LLMs) and Reinforcement Learning. You won't just be maintaining systems; you'll be architecting the core intelligence behind our products.

Responsibilities

  • Architect Next-Gen LLM Pipelines: Design and implement scalable, high-performance systems for training and fine-tuning state-of-the-art generative models.
  • Optimize Inference & Latency: Engineer solutions to reduce token generation latency and optimize model resource utilization for real-time applications.
  • Build Retrieval-Augmented Generation (RAG) Frameworks: Develop robust systems to connect proprietary data with LLMs, ensuring factual accuracy and context-aware responses.
  • Research & Development: Stay at the forefront of AI trends, experimenting with novel architectures (e.g., Mixture of Experts, Transformer variants) to drive competitive advantage.
  • Collaborate Across Disciplines: Partner with product managers and designers to translate complex AI capabilities into intuitive user experiences.
  • Mentorship: Guide junior engineers and data scientists, fostering a culture of technical excellence and continuous learning.

Qualifications

  • Education: Master’s or PhD in Computer Science, Mathematics, or a related technical field.
  • Experience: 5+ years of professional experience in Machine Learning or Artificial Intelligence, with at least 2 years focusing on Generative AI.
  • Programming: Deep expertise in Python, PyTorch, and TensorFlow. Experience with distributed training frameworks (Ray, Horovod) is a plus.
  • Model Mastery: Proven track record of working with large-scale transformer models (GPT, BERT, Llama) and fine-tuning strategies.
  • Problem Solving: Exceptional ability to debug complex systems and optimize performance under tight constraints.
  • Communication: Excellent written and verbal communication skills, capable of explaining technical concepts to non-technical stakeholders.

Required Skills

Python PyTorch TensorFlow Machine Learning NLP LLMs Generative AI Deep Learning Distributed Computing CUDA

Ready to Take This Challenge?

Make sure your resume is ready. Submit your application now before the deadline.

Apply Now

Related Jobs

Similar job recommendations for you

View All