Role Overview:
We are seeking a Senior AI Architect with deep expertise in architecting and
delivering end-to-end AI solutions on the cloud, including hands-on experience in
Generative AI and Large Language Models (LLMs). This role demands a
leader who can translate complex business use cases into scalable AI
architectures, drive delivery across the lifecycle—from data engineering to
deployment and governance—and engage with clients in highly technical
discussions.
You will lead solutioning and implementation of LLM-based applications,
including agent-based systems, prompt engineering, RAG pipelines, and
embedding integrations, with a strong focus on performance, cost-
efficiency, security, and Responsible AI practices.
Key Responsibilities:
Lead end-to-end solution design and architecture for AI/ML and Gen
AI initiatives across hyperscalers (Azure, AWS, GCP).
Translate business requirements into robust AI/ML pipelines, covering
data ingestion, model training, fine-tuning, deployment, monitoring, and
governance.
Architect and deploy LLM-based agents using OpenAI, Azure OpenAI,
Hugging Face, and open-source LLMs.
Implement prompt engineering techniques (zero-shot, few-shot, many-
shot) and RAG pipelines using vector databases (e.g., FAISS, Pinecone,
Weaviate).
Evaluate models for performance, suitability, accuracy vs cost vs
security, and ensure compliance with Responsible AI principles.
Own technical discussions with clients on topics like LLMOps, platform
selection, cost trade-offs, model explainability, and data privacy.
Guide the development and operationalization of Responsible AI
frameworks in high-security, regulated environments.
Drive thought leadership, create Gen AI assets, solution accelerators,
PoVs, and contribute to internal capability building.
Lead and mentor a team of AI architects, engineers, and data
scientists, ensuring delivery quality and innovation.
Oversee AI project delivery, including stakeholder management, scope
alignment, and risk mitigation.
Drive AI delivery for both global and domestic Indian clients, with a
focus on enterprise-scale deployments.
Required Qualifications:
10+ years of experience in designing and delivering enterprise-grade
AI/ML solutions.
2+ years of active experience in Generative AI, with strong exposure to
LLMs, agent architectures, and vector-based search.
Proven track record in delivering AI solutions on Azure, especially using
Azure OpenAI, OpenAI GPT models, and other cloud-native ML tools.
Deep experience with open-source LLMs, Hugging Face, and
integration of custom/tuned models.
Strong programming skills in Python, familiarity with ML frameworks like
PyTorch, TensorFlow, and tools like LangChain, LlamaIndex.
Practical experience implementing AI Governance, ML Ops/LLM Ops,
and Responsible AI standards in real-world deployments.
Proven ability to engage clients on technical strategy, architecture design,
and solution trade-offs across performance, cost, and security.
Experience creating AI/Gen AI thought leadership (whitepapers, blogs,
reusable assets).
Strong leadership in managing cross-functional AI teams and delivering
across geographies, including India-based clients.