We are looking for a Senior Solutions Architect to design, develop, and scale innovative AI / ML-driven solutions. You will be responsible for architecting highly scalable, low-latency distributed systems optimized for AI / ML workloads. As a key technical leader, you will solve complex challenges, influence next-generation AI / ML infrastructures, and guide cross-functional teams to deliver state-of-the-art solutions for fast-growing startups and enterprise companies.
Be at the forefront of shaping next-generation AI / ML infrastructures, driving solutions for high-impact products across diverse industries. You'll have the opportunity to influence key architectural decisions and enable real-world applications that scale globally, ensuring innovation and efficiency at every step.
you’ll be responsible for—
Driving end-to-end GenAI architecture and implementation :
- Design and deploy multi-agent systems using modern frameworks (LangGraph, CrewAI, AutoGen)
- Architect RAG solutions with advanced vector store integration
- Implement efficient fine-tuning strategies for foundation models
- Develop synthetic data generation pipelines for training and testing
Leading ML infrastructure and deployment :
Design high-performance model serving architecturesImplement distributed training and inference systemsEstablish MLOps practices and pipelinesOptimize cloud resource utilization and costsSet up monitoring and observability solutionsDriving technical excellence and innovation :
Define architectural standards and best practicesLead technical decision-making for AI / ML initiativesEnsure scalability and reliability of AI systemsImplement AI governance and security measuresGuide teams on advanced AI concepts and implementationsOverseeing production AI systems :
Manage model deployment and versioningImplement A / B testing frameworksMonitor system performance and model driftOptimize inference latency and throughputEnsure high availability and fault toleranceFostering collaboration and growth :
Mentor engineering teams on AI architectureCollaborate with stakeholders on technical strategyDrive innovation in AI / ML solutionsShare knowledge through documentation and trainingLead technical reviews and architecture discussionsyou need—
8 years experience in software engineering or architecture, including :
4 years leading cross-functional GenAI / ML teamsProduction experience with distributed AI systemsEnterprise-scale AI architecture implementationTo lead and architect enterprise-scale GenAI / ML solutions, focusing on :
Multi-agent orchestration using LangGraph, CrewAI, and AutoGenWorkflow automation with LlamaIndex, LangChain, and LangFlowAgent coordination using LETTA frameworkIntegration of specialized agents for reasoning, planning, and executionTo design and implement sophisticated AI architectures incorporating :
Advanced RAG systems using :Vector databases (Chroma, Weaviate, Pinecone, Milvus)Hybrid search with BM25 and semantic embeddingsSelf-querying and recursive retrieval patternsFine-tuning strategies for foundation models :PEFT methods (LoRA, QLoRA, Adapter-tuning)Parameter-efficient training approachesInstruction fine-tuning and RLHFMulti-agent frameworks integrating :Tool-use and reasoning chainsMemory systems (short-term and long-term)Meta-prompting and reflection mechanismsAgent communication protocolsExpertise in advance data generation and synthesis :
Synthetic data generation using Arigilla and PersonaHubPrivacy-preserving data synthesisDomain-specific data augmentationQuality assessment of synthetic dataData balancing and bias mitigationTo architect high-performance ML serving infrastructure focusing on :
Model serving platforms (BentoML, Ray Serve, Triton)Real-time processing with Ray, Kafka, and Spark StreamingDistributed training using Horovod, DeepSpeed, and FSDPvLLM and TGI for efficient inferenceIntegration patterns for hybrid cloud-edge deploymentsTo drive cloud architecture decisions across :
Kubernetes orchestration with Kubeflow and KServeServerless ML with AWS Lambda, Azure Functions, Cloud RunAuto-scaling using HPA, KEDA, and custom metricsResource optimization with Nvidia Triton and TensorRTMLOps platforms (MLflow, Weights & Biases, DVC)bonus points for—
Research publications in AI / MLOpen-source project maintenanceTechnical blog posts on AI architectureConference presentationsAI community leadershipwhat you get—
Best in class salary : We hire only the best, and we pay accordingly.Proximity Talks : Meet other designers, engineers, and product geeks — and learn from experts in the field.Keep on learning with a world-class team : Work with the best in the field, challenge yourself constantly, and learn something new every day.about us—
Proximity is the trusted technology, design, and consulting partner for leading startups, fast-growing scale-ups, and global enterprises. We’re headquartered in San Francisco and have offices in Palo Alto, Dubai, Mumbai, and Bangalore. Since 2019, Proximity has created and grown high-impact, scalable products used by 370 million daily users, with a total net worth of $45.7 billion among our client companies.
We are Proximity — a global team of coders, designers, product managers, geeks, and experts. We solve complex problems and build cutting edge tech, at scale. Our team of Proxonauts is growing quickly, which means your impact on the company’s success will be huge. You’ll have the chance to work with experienced leaders who have built and led multiple tech, product and design teams.
You can visit our website Proximity.tech and :
Watch our CEO, Hardik Jagda, tell you all about Proximity.Read about Proximity’s values and meet some of our Proxonauts.Explore our website, blog, and the design wing — Studio Proximity.Get behind-the-scenes with us on Instagram! Follow @ProxWrks and @H.Jagda