AI Performance Software Engineer Job at Signify Technology, San Francisco, CA

SnZnSTJ2aFZETTd2SDNXSHpyeE9ZS3Q2S3c9PQ==
  • Signify Technology
  • San Francisco, CA

Job Description

AI Performance Engineer – CUDA & PyTorch Focus

Location: San Fransisco, CA

Compensation: $200,000-$300,000

A stealth-mode AI systems company is reimagining how large-scale inference is done. With generative AI workloads scaling rapidly, inference efficiency has become a critical bottleneck. We're building an integrated hardware-software platform that brings breakthrough performance and usability to production-scale LLM applications.

This is an opportunity to work on a highly technical team spun out of top-tier academic research, focused on the cutting edge of AI, distributed systems, and performance optimization.

What You’ll Do:

  • Drive core research and implementation of performance optimizations for modern AI models
  • Implement advanced techniques like FlashAttention, KV caching, quantization, and model compression
  • Design and build scalable, distributed compute strategies across GPU-based systems
  • Profile, benchmark, and optimize CUDA kernels and AI runtime performance across inference stacks
  • Work across frameworks like PyTorch, ONNX, and vLLM to improve end-to-end efficiency

What We're Looking For:

  • Strong background in CUDA and low-level GPU performance tuning
  • Proven experience building with PyTorch and deploying high-performance ML models
  • Proficiency in Python and C++
  • Experience with large-scale distributed systems in cloud environments (AWS, GCP, or Azure)
  • Exposure to AI compilers or frameworks like MLIR is a plus
  • Interest in system design, scalability, and accelerating LLM workloads in real production environments

If you’ve spent your time making large models faster, leaner, and more efficient—and want to solve hard technical problems at the core of GenAI infrastructure—this role is for you.

Reach out to learn more.

Job Tags

Similar Jobs

Intellias

Senior Information Security Specialist Job at Intellias

 ...The Information Security Specialist III is a senior technical expert responsible for leading the design, implementation, and oversight of the organization's cybersecurity strategy. This role requires deep knowledge of information security concepts, advanced threat mitigation... 

WorldLink US

UX / UI Designer Job at WorldLink US

 ...TITLE: UX / UI Designer IV POSITION TYPE: Full Time (W2) LOCATION: New York, NY or Mountain View, CA ABOUT WorldLink: WorldLink is a rapidly growing information technology company at the forefront of the tech transformation. From custom software development... 

Prana Talent

RN CVICU | CARDIOVASCULAR INTENSIVE CARE UNIT NURSE Job at Prana Talent

 ...make a transformative impact in the world of healthcare as a/an RN CVICU | CARDIOVASCULAR INTENSIVE CARE UNIT NURSE LAKE...  ...Nurse, You are the gold standard in nursing, blending advanced clinical expertise with unshakable compassion. From bedside care to boardroom... 

Holy Cross Health Fl

Clinical Research Associate - Registered Nurse Job at Holy Cross Health Fl

 ...responsible for the conduct of multiple Oncology clinical research activities. Ideal candidate will...  ..., FDA, ICH, GCP, or other sources *FL RN License *On-Site *Oncology Experience...  ...Lauderdale, Florida is a full-service, non-profit Catholic hospital, sponsored by the... 

HireTalent - Staffing & Recruiting Firm

Computer Numerical Control Machinist Job at HireTalent - Staffing & Recruiting Firm

 .... ~ Experience with aerospace-grade materials. ~ Skilled with Fanuc, Siemens, Heidenhain, or Mazatrol controls. ~ Strong blueprint reading and GD&T knowledge. Preferred: Mastercam or NX programming experience VTL, CMM, EDM, or grinding experience...