AI Inference Engineer Job at Signify Technology, San Francisco, CA

WGI0SitOb2x6UTVIWFp6eUVxQks2TUs4
  • Signify Technology
  • San Francisco, CA

Job Description

AI Inference Engineer – Stealth Startup | San Fransisco Onsite

Compensation: $200K–$300K + equity

Join a stealth-stage team backed by prominent academic research and successful technical founders, working at the bleeding edge of AI infrastructure. As generative AI continues to scale rapidly, the bottleneck is no longer training—it’s inference. This team is rebuilding the core systems that power inference, from kernel-level GPU optimizations to full-stack distributed deployment.

This role is ideal for engineers who want to go deep: working on quantization, KV caching, attention mechanisms like FlashAttention, and designing new strategies for parallelism across heterogeneous compute. You'll contribute to an integrated software-hardware stack that enables large-scale model deployment with dramatically improved performance, efficiency, and quality—at production scale.

What You’ll Be Doing:

  • Research and implement state-of-the-art techniques to improve AI model inference speed and quality
  • Architect and optimize distributed AI infrastructure across both GPU kernel and software layers
  • Profile, benchmark, and debug system performance across varied hardware environments
  • Drive improvements in model execution through compiler-level tuning, caching, and runtime strategies

What They’re Looking For:

  • Bachelor's degree in Computer Science, Engineering, Applied Math, or a related field
  • Strong experience with performance optimization and systems-level thinking
  • Proficiency in Python, C++, and CUDA
  • Familiarity with AI frameworks like PyTorch, TensorFlow, ONNX, or vLLM

Nice to Have:

  • Graduate degree in a technical field
  • Experience with MLIR or other compiler frameworks
  • Hands-on work with large-scale GPU infrastructure or custom kernels

This is a hands-on, foundational role in a fast-moving environment, offering the chance to shape the backbone of the next generation of AI systems.

Job Tags

Similar Jobs

UPS

PT Warehouse Supervisor Job at UPS

 ...ensuring all employees adhere to safety policies and procedures at all times, operating in full compliance with department, station, corporate...  ...lift 70 lbs./32 kgs, availability to work flexible shift hours up to 5 days per week, strong oral and written communication skills,... 

Regal Medical Group

Summer Internship Program, Graphic Design Job at Regal Medical Group

Want to pursue a career in healthcare, business, finance, marketing, or information systems?Gain valuable on-the-job and real-world...  ...start you on your career path.Top Reasons to Join Our 2025 Summer Internship ProgramWe have subject matter experts and leaders in the... 

The Gogan Team

Experienced Transaction Coordinator Job at The Gogan Team

 ...dedicated and detail-oriented Real Estate Transaction Coordinator to join our growing team in Southwest...  ...customers Negotiate repairs, oversee home inspections, and ensure a timely close...  ... Time Management Skills Ability to work on your ownAbout Company: The Gogan Team... 

Mclean Counseling Center

Marketing Assistant Job at Mclean Counseling Center

 ...Competitive Compensation Great Work Environment Career Advancement Opportunities Job Summary We are seeking a Marketing Assistant to join our team! As our Marketing Assistant, you will be working closely with the Marketing Department, following up on leads... 

PMA USA (Performance Matters Associates, Inc.)

Insurance Sales Representative Job at PMA USA (Performance Matters Associates, Inc.)

 ...PMA USA (Performance Matters Associates, INC.) , is a national company that provides insurance benefits solutions and markets voluntary insurance products. We exclusively represent Washington National Insurance Company, who has been around for more than 100 years and is...