Job Description
We are seeking a visionary Senior Generative AI Engineer to spearhead our next-generation language model initiatives. As we prepare for the technological landscape of 2026, we need an expert who can architect scalable, ethical, and high-performance AI solutions. If you are passionate about the future of Natural Language Processing (NLP) and want to build systems that redefine human-machine interaction, this is your opportunity to lead.
What you will do:
Responsibilities
- Design and deploy scalable Large Language Model (LLM) architectures using Python and PyTorch.
- Implement and optimize Retrieval-Augmented Generation (RAG) pipelines to enhance model accuracy and reduce hallucinations.
- Conduct research on state-of-the-art transformer models and fine-tune them for specific enterprise domains.
- Optimize inference latency and cost-efficiency for high-volume production environments.
- Collaborate with product and data science teams to integrate AI capabilities into existing software ecosystems.
Qualifications
- 5+ years of experience in software engineering, with at least 2 years specializing in Machine Learning or Deep Learning.
- Strong proficiency in Python, C++, or Java and deep understanding of computer science fundamentals.
- Extensive experience with PyTorch, TensorFlow, or JAX.
- Proven track record of deploying models in cloud environments (AWS, GCP, or Azure).
- Experience with vector databases (Pinecone, Milvus) and MLOps pipelines.