Software Engineer, AI Inference Codesign
Company: Tesla, Inc.
Location: Palo Alto
Posted on: April 16, 2025
Job Description:
Software Engineer, AI Inference CodesignJob Category: AI &
RoboticsLocation:Req. ID: 241261Job Type: Full-timeWhat to
Expect:The AI inference codesign team's goal is to take research
models and make them run efficiently on our AI-ASIC to power
real-time inference for Autopilot and Optimus programs. This unique
role lies at the intersection of AI research, compiler development,
kernel optimization, math and HW design. You will work extensively
with AI engineers and come up with novel techniques to quantize
models, improve precision and explore non-standard alternate
architectures. You will be developing optimized micro kernels using
a cutting-edge MLIR compiler and solve the performance bottlenecks
needed to achieve real-time latency needed for self-driving and
humanoid robots. You will work closely with the HW team and bring
state-of-the-art HW architecture techniques to our next generation
HW SoCs.What You'll Do:
- Research and implement state-of-the-art machine learning
techniques to achieve high performance on our edge hardware.
- Optimize bottlenecks in the inference flow, make
precision/performance tradeoff decisions and figure out novel
techniques to improve hardware utilization and throughput.
- Implement/improve highly performant micro kernels for Tesla's
AI ASIC.
- Work with AI teams to design edge friendly neural network
architectures.
- Collect extensive performance benchmarks (latency, throughput,
power) and work with HW teams to shape the next generation of
inference hardware, balancing performance with versatility.
- Experiment with numerical methods and alternative
architectures.
- Collaborate with the compiler infrastructure for
programmability and performance.What You'll Bring:
- Degree in Engineering, Computer Science or equivalent in
experience and evidence of exceptional ability.
- Proficiency with Python and C++, including modern C++
(14/17/20).
- Experience with AI networks, such as CNNs, transformers, and
diffusion model architectures, and their performance
characteristics.
- Understanding of GPU, SIMD, multithreading and/or other
accelerators with vectorized instructions.
- Exposure to computer architecture and chip
architecture/micro-architecture.
- Specialized experience in one or more of the following machine
learning/deep learning domains: Model compression, hardware aware
model optimizations, hardware accelerators architecture, GPU/ASIC
architecture, machine learning compilers, high performance
computing, performance optimizations, numerics and SW/HW
co-design.Compensation and Benefits:Along with competitive pay, as
a full-time Tesla employee, you are eligible for the following
benefits at day 1 of hire:
- Aetna PPO and HSA plans > 2 medical plan options with $0
payroll deduction.
- Family-building, fertility, adoption and surrogacy
benefits.
- Dental (including orthodontic coverage) and vision plans, both
have options with a $0 paycheck contribution.
- Company Paid (Health Savings Account) HSA Contribution when
enrolled in the High Deductible Aetna medical plan with HSA.
- Healthcare and Dependent Care Flexible Spending Accounts
(FSA).
- 401(k) with employer match, Employee Stock Purchase Plans, and
other financial benefits.
- Company paid Basic Life, AD&D, short-term and long-term
disability insurance.
- Employee Assistance Program.
- Sick and Vacation time (Flex time for salary positions), and
Paid Holidays.
- Back-up childcare and parenting support resources.
- Voluntary benefits to include: critical illness, hospital
indemnity, accident insurance, theft & legal services, and pet
insurance.
- Weight Loss and Tobacco Cessation Programs.
- Tesla Babies program.
- Commuter benefits.
- Employee discounts and perks program.Expected
Compensation:$120,000 - $318,000/annual salary + cash and stock
awards + benefits.Pay offered may vary depending on multiple
individualized factors, including market location, job-related
knowledge, skills, and experience. The total compensation package
for this position may also include other elements dependent on the
position offered. Details of participation in these benefit plans
will be provided if an employee receives an offer of
employment.Tesla is an Equal Opportunity employer. All qualified
applicants will receive consideration for employment without regard
to any factor, including veteran status and disability status,
protected by applicable federal, state or local laws.Tesla is also
committed to working with and providing reasonable accommodations
to individuals with disabilities. Please let your recruiter know if
you need an accommodation at any point during the interview
process.Privacy is a top priority for Tesla. We build it into our
products and view it as an essential part of our business. To
understand more about the data we collect and process as part of
your application, please view our Tesla Talent Privacy Notice.
#J-18808-Ljbffr
Keywords: Tesla, Inc., Merced , Software Engineer, AI Inference Codesign, IT / Software / Systems , Palo Alto, California
Didn't find what you're looking for? Search again!
Loading more jobs...