Preferred Major: Bachelor's degree in Computer Science or equivalent
Experience Required:
At least 3 years of experience in the same field
Minimum Job Requirements:
• Cloud Skills - GCP/OpenShift, Kubernetes (k8s), Docker containers/images
• AI Skills – Model training, testing/evaluation, deployment
• ML/LLMOPs
• LLMs and GenAI core skills – how do LLMs work under the hood, inference mechanics of LLMs/ GenAI
• Inference scaling, distributed computing, inference benchmarking, inference planning for meeting SLAs/SLOs
• GPUs and how to work with them, distributed workloads handling, autoscaling
• NVIDIA NIMs, Huggingface
• NVIDIA Superpods (HPC, slurm, k8s)
• Monitoring, dashboards for LLM/ML workloads and applications
• AI Application Architecture know-how, end to end flows
• DevOps (CI/CD, argoCD, git, Jenkins etc)
• Languages: Python, SQL
Skills
Professionalism of communications and soft skills, good leadership in project management