AI Performance Software Engineer Job at Signify Technology, San Francisco, CA

ckpHRjZZSVFON2c0NHgrUFNaZVl1LzRGMWc9PQ==
  • Signify Technology
  • San Francisco, CA

Job Description

AI Performance Engineer – CUDA & PyTorch Focus

Location: San Fransisco, CA

Compensation: $200,000-$300,000

A stealth-mode AI systems company is reimagining how large-scale inference is done. With generative AI workloads scaling rapidly, inference efficiency has become a critical bottleneck. We're building an integrated hardware-software platform that brings breakthrough performance and usability to production-scale LLM applications.

This is an opportunity to work on a highly technical team spun out of top-tier academic research, focused on the cutting edge of AI, distributed systems, and performance optimization.

What You’ll Do:

  • Drive core research and implementation of performance optimizations for modern AI models
  • Implement advanced techniques like FlashAttention, KV caching, quantization, and model compression
  • Design and build scalable, distributed compute strategies across GPU-based systems
  • Profile, benchmark, and optimize CUDA kernels and AI runtime performance across inference stacks
  • Work across frameworks like PyTorch, ONNX, and vLLM to improve end-to-end efficiency

What We're Looking For:

  • Strong background in CUDA and low-level GPU performance tuning
  • Proven experience building with PyTorch and deploying high-performance ML models
  • Proficiency in Python and C++
  • Experience with large-scale distributed systems in cloud environments (AWS, GCP, or Azure)
  • Exposure to AI compilers or frameworks like MLIR is a plus
  • Interest in system design, scalability, and accelerating LLM workloads in real production environments

If you’ve spent your time making large models faster, leaner, and more efficient—and want to solve hard technical problems at the core of GenAI infrastructure—this role is for you.

Reach out to learn more.

Job Tags

Similar Jobs

Amentum

Flight Instructor - UH-72 Job at Amentum

 ...of **Helicopter Pilot** **(Instructor)** at **Fort Novosel, Alabama.**Applicants selected for hire will provide instruction in the UH-72. Military aviation experience or UH-72 qualification is not required.**What We Do**Since 1989, Amentum Services, Inc. with its 4... 

Jobot

Immigration Paralegal Job at Jobot

 ...Immigration Paralegal w/ EB 5 This Jobot Job is hosted by: Christopher Mildyn Are you a fit? Easy Apply now by clicking the "Apply Now" button and sending us your resume. Salary: $65,000 - $80,000 per year A bit about us: dedicated and full-service Immigration... 

Real Time Consulting LLC (RTC)

Human Factors Engineer Job at Real Time Consulting LLC (RTC)

 ...REAL TIME CONSULTING (An RTCo Company)POSITION DESCRIPTION JOB TITLE: Human Factors Engineer About The Job We are seeking human factors engineers to provide broad experience in product development, services development, in-service activities, and certification... 

H2Health

Bilingual Pediatric Speech-Language Pathologist Assistant Job at H2Health

 ...Position Title: Bilingual Pediatric Speech-Language Pathologist Assistant Location: Longview, Texas, United States Department: Longview Work Type: Full-time Workplace Type: On-site Description Bilingual Speech-Language Pathologist... 

NHC Place Sumner

Nurse Assessment Coordinator / MDS - RN Job at NHC Place Sumner

 ...compensation wage increases based on performance. If you want this experience in your career, apply today! Position: RN, Nurse Assessment Coordinator / MDS / RN Supervisor Pay: $40-$45 an hour Qualifications: Registered Nurse - graduate of an accredited...