Principal Deep Learning Comm. Architect

Job Description

Job Title: Principal Deep Learning Communication Architect. Location: Santa Clara, CA, USA; Austin, TX, USA; Remote (USA)

Responsibilities:

  • Define the long-term technical roadmap for communication libraries across next-generation platforms.
  • Enable seamless scaling of AI models to clusters with hundreds of thousands of nodes.
  • Lead the design and development of next-generation communication primitives and collective algorithms.
  • Optimize communication for heterogeneous interconnects such as NVLink, Ethernet, and InfiniBand;
  • Collaborate with application developers to design specialized communication primitives for AI and HPC workloads.
  • Ensure AI and HPC libraries such as NCCL, NVSHMEM, UCC, and UCX evolve to support large-scale models and advanced AI systems.
  • Work closely with hardware and software teams to influence next-generation networking hardware specifications.
  • Develop analytical models and simulators to predict system performance under emerging workloads.

Qualifications:

  • PhD or MS in Computer Science, Electrical Engineering, or a related field, or equivalent experience.
  • 12 or more years of industry experience in high-performance computing or distributed deep learning.
  • Deep understanding of parallelism techniques including data, tensor, and pipeline parallelism;
  • Expertise in advanced parallelism strategies such as context parallelism, expert parallelism, and ZeRO optimizations.
  • Strong technical proficiency with NCCL, UCX, UCC, NVSHMEM, or MPI.
  • Experience with RDMA, RoCE, and InfiniBand technologies;
  • Advanced knowledge of inference engines such as TensorRT-LLM, vLLM, or similar systems.
  • Strong understanding of GPU architecture, including memory hierarchy and CUDA programming models.

Preferred:

  • Experience developing frameworks such as Megatron-Core, DeepSpeed, or JAX/XLA.
  • Contributions to open-source projects such as PyTorch Distributed, KServe, or Ray;
  • Proven experience deploying and optimizing large-scale models on GPU-based systems.
  • Publications or patents in top-tier systems or architecture conferences.

Benefits and Compensation:

  • Base salary range: $272,000 to $431,250 per year, depending on location and experience.
  • Eligibility for equity and comprehensive benefits package;
  • Opportunity to work with cutting-edge AI and HPC technologies in a high-impact environment.

JOB TYPE

Full-time

COMPENSATION

$272k - $431k

SHARE

Important: To avoid application spam, include this statement at the end of your resume or application: 'I found this position on ( Quantum Jobs USA ) .' Applications without it will be disqualified.

Back to all Quantum jobs
arrow mark
📌 Be the FIRST to join Qizz — The Talent Intelligence Platform for quantum.  Join Waitlist

Suggested Read:

Quantum Career Roadmap

This is your step-by-step guide to building a career in quantum computing.

Read

How to Attract Quantum Talent

Recruiting quantum talent is not like traditional tech hiring. You have to go where they are.

Read

Do I need PhD for Quantum Job

IBM says over 60% of quantum jobs don’t require a PhD, showing diversity in the field.

Read

Quantum Jobs Salary

This guide explains how much you can earn in quantum jobs in the U.S.

Read

Quantum Job Requirement

This guide provides necessary educational pathways, certifications, skills info.

Read

Quantum Jobs in USA

Learn about the the quantum computing job market in the USA.

Read

Few related jobs: