AI Performance Software Engineer Job at Signify Technology, San Francisco, CA

ZDQycDVGak5OZHlSODYvY3dWZHlQdmNLZmc9PQ==
  • Signify Technology
  • San Francisco, CA

Job Description

AI Performance Engineer – CUDA & PyTorch Focus

Location: San Fransisco, CA

Compensation: $200,000-$300,000

A stealth-mode AI systems company is reimagining how large-scale inference is done. With generative AI workloads scaling rapidly, inference efficiency has become a critical bottleneck. We're building an integrated hardware-software platform that brings breakthrough performance and usability to production-scale LLM applications.

This is an opportunity to work on a highly technical team spun out of top-tier academic research, focused on the cutting edge of AI, distributed systems, and performance optimization.

What You’ll Do:

  • Drive core research and implementation of performance optimizations for modern AI models
  • Implement advanced techniques like FlashAttention, KV caching, quantization, and model compression
  • Design and build scalable, distributed compute strategies across GPU-based systems
  • Profile, benchmark, and optimize CUDA kernels and AI runtime performance across inference stacks
  • Work across frameworks like PyTorch, ONNX, and vLLM to improve end-to-end efficiency

What We're Looking For:

  • Strong background in CUDA and low-level GPU performance tuning
  • Proven experience building with PyTorch and deploying high-performance ML models
  • Proficiency in Python and C++
  • Experience with large-scale distributed systems in cloud environments (AWS, GCP, or Azure)
  • Exposure to AI compilers or frameworks like MLIR is a plus
  • Interest in system design, scalability, and accelerating LLM workloads in real production environments

If you’ve spent your time making large models faster, leaner, and more efficient—and want to solve hard technical problems at the core of GenAI infrastructure—this role is for you.

Reach out to learn more.

Job Tags

Similar Jobs

GQR Healthcare

Travel Nurse RN - CVICU Job at GQR Healthcare

 ...Job Description GQR Healthcare is seeking a travel nurse RN CVICU for a travel nursing job in Woodland Hills, California. Job Description...  ...-Interventional Radiology RN- OK if you do not have experience. They WILL train! Ideally looking for a PACU, ICU, Cath Lab or... 

iHerb

Project Coordinator- Localization Job at iHerb

 ...Job Summary: The Project Coordinator will support the day-to-day operations of the Global Localization Team, with a focus on cross-functional coordination, vendor communication...  ...internal stakeholders and external LSPs. Manage daily operations such as email/chat... 

Metro Express Logistics Inc

Class A Truck Drivers Wanted Job at Metro Express Logistics Inc

Class A Truck Drivers WantedWe are looking for OTR drivers. All of our trips are cross country. Our drivers due over 3,000 miles a week. Trip...  ...THEIR OWN SCHEDULE - MINIMAL TRIP IS 7-10 DAYS. HOME TIME IS UP TO YOU. (1-15 days)- MAXIMUM TRIP IS UNLIMITED.How to apply... 

American Income Life

Work From Home Benefit Specialist Job at American Income Life

 ...currently only hiring U.S. residents who are legally authorized to work in the United States with a social security # (US Only). We are...  ...achieving remarkable success? Join our fully virtual and work-from-home team, where you can earn an extraordinary income without... 

Aldi

2nd Shift Full-Time Warehouse Associate Job at Aldi

 ...collaborative working environment with peers and supervisors. Demonstrates a Positive Attitude and Resilience: Adapts positively to pressure, setbacks, challenges and change in order to achieve and sustain peak effectiveness. Drives for Success: Delivers excellent...