Sr Principal Software Engineer, Quantization (Ai2324)

Sima Technologies, Inc. San Jose , CA 95111

Posted 2 weeks ago

Job Title: Sr Principal Software Engineer, Quantization

Job Location: San Jose, CA

Job Number: AI2324

Job Description:

SiMa.ai is seeking an outstanding researcher working on efficient deep learning to join the MLSoC Platform Architecture team. We are passionate about pushing the boundaries of Edge AI with power efficient inferencing. We are particularly interested in Post Training Quantization and Pruning techniques applied to quantization of CNN and Transformer based Neural Networks for inference primarily on int8 Machine Learning Accelerator (MLA) and on mixed precision MLA. You will work with an amazing team of engineers that pushes the boundaries and your contributions will have a chance to create a real impact in our products.

Sr. Principal Engineer Key Responsibilities (including but not limited to):

  • Research, design and implement novel methods to improve PTQ techniques for both int8 and mixed-precision (int8 + bf16) quantization.

  • Collaborate with other team members to understand the limitations of our Machine Learning Accelerator and adapt your strategy based on their input.

  • Prototype PTQ techniques using Fake Quantization in PyTorch, as well as modify internal tools to implement quantized operators to verify accuracy.

  • Understand state-of-the-art research in PTQ and apply it to CNN and Transformer based Neural Networks.

  • Help define timeline and deliverables and be accountable for them.

Required Background:

  • PhD in electrical engineering or computer science with 6+ years research numerical methods and tools in efficient Neural Network inferencing.

  • Proficient in techniques like HAWQ2, and RL based methods for Mixed-precision quantization.

  • Proficient in state-of-the-art PTQ techniques like Optimum Brain Compression for LLMs.

  • Proficient with PyTorch or other Quantization exploration frameworks like Model Compression Toolkit.

  • Excellent programming skills in C++, Python.

Highly desirable:

  • Co-authored internal technical presentations, research papers and disclosures/patents on key technical topics

  • Noteworthy technical contributions, which were multi-disciplinary and in collaboration with other cross-functional teams.

Personal Attributes:

You have a can-do attitude, are execution and results focused, highly accountable, a strong team player, with high integrity, and have a track record of innovation. Capable of handling difficult conversations in a professional manner. You are self motivated and have a great desire to have a significant impact on products that provide value to customers.

The annual salary for this position ranges from $240,000 to $305,000. The actual annual salary paid for this position will be based on several factors, including but not limited to, skills, prior experiences, qualifications, expertise, work location, total target compensation, training, company needs, and current market demands. The annual salary range for this position is subject to change and may be adjusted in the future.

EEO Employer: SiMa is an equal opportunity employer; all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status or any other protected classification.


icon no score

See how you match
to the job

Find your dream job anywhere
with the LiveCareer app.
Mobile App Icon
Download the
LiveCareer app and find
your dream job anywhere
App Store Icon Google Play Icon
lc_ad

Boost your job search productivity with our
free Chrome Extension!

lc_apply_tool GET EXTENSION

Similar Jobs

Want to see jobs matched to your resume? Upload One Now! Remove

Sr Principal Software Engineer, Quantization (Ai2324)

Sima Technologies, Inc.