Job Description
Are you ready to define the next era of artificial intelligence? Nexus Future Labs is seeking a visionary Senior Generative AI Engineer to lead the development of next-generation Large Language Models (LLMs) and generative neural networks. In this pivotal role, you will bridge the gap between theoretical research and production-grade applications, pushing the boundaries of what AI can achieve.
We are looking for a technical leader who thrives in a fast-paced, innovative environment. You will own the architecture of our core AI systems, mentor junior engineers, and collaborate with product teams to deploy AI solutions that impact millions of users worldwide.
Responsibilities
- Model Development: Design, train, and fine-tune state-of-the-art generative models (LLMs, diffusion models) using modern deep learning architectures.
- Architecture & Optimization: Architect scalable inference pipelines and optimize model performance for latency and throughput in production environments.
- RAG Implementation: Develop and deploy Retrieval-Augmented Generation systems to enhance model accuracy and reduce hallucinations.
- Research & Innovation: Stay ahead of the curve by integrating cutting-edge research findings into our production stack.
- Collaboration: Partner with cross-functional teams including product managers, data scientists, and software engineers to deliver high-impact features.
- Mentorship: Guide and mentor a team of junior AI engineers and researchers, fostering a culture of technical excellence.
Qualifications
- Education: Masterβs or PhD in Computer Science, Mathematics, or a related field with a focus on AI/ML.
- Experience: 5+ years of professional experience in machine learning, deep learning, or natural language processing.
- Technical Skills: Proficiency in Python, PyTorch, TensorFlow, and experience with Hugging Face Transformers.
- LLM Expertise: Deep understanding of transformer architectures, attention mechanisms, and fine-tuning methodologies.
- Tools: Experience with cloud platforms (AWS/GCP/Azure), containerization (Docker/Kubernetes), and MLOps tools.
- Communication: Exceptional written and verbal communication skills with the ability to translate complex technical concepts for diverse audiences.