Phd Multimodal AI Intern

Dolby Sound Laboratories San Francisco , CA 94118

Posted 2 weeks ago

Join the leader in entertainment innovation and help us design the future. The Dolby U internship program offers impactful, project-based work experience in a collaborative, creative environment where you work side by side with industry leaders. Amplify your insatiable curiosity by implementing real-world solutions that revolutionize how people communicate and how entertainment is created, delivered, and enjoyed worldwide. We offer a collegial culture, challenging projects, and excellent compensation and benefits, not to mention a Flex Work approach that is truly flexible to support where, when, and how you do your best work. For any student seeking to gain invaluable expertise through meaningful, personal contributions, we invite you to join us in continuing to design a future where technology meets entertainment!

The Advanced Technology Group (ATG) is the research division of the company. ATG's mission is to look ahead, deliver insights, and innovate technological solutions that will fuel Dolby's continued growth. Our researchers have a broad range of expertise related to computer science and electrical engineering, such as AI/ML, algorithms, digital signal processing, audio engineering, image processing, computer vision, data science & analytics, distributed systems, cloud, edge & mobile computing, computer networking, and IoT.

Responsibilities

As a member of the Multimodal Processing Team, your role will involve creating novel AI

algorithms that utilize audio, video, text, or other input modalities. These algorithms aim to

enhance audiovisual experiences, and intelligently analyze or process content, with the ultimate

mission to build innovative technologies that can revolutionize entertainment.

At Dolby, everyone is invested in your success and strives to make it the best place for you to

start your career. As part of your internship experience at Dolby, you'll get the following:

  • First-hand exposure to Dolby technology.

  • A diverse, open, and welcoming culture.

  • Practical experience: get to be a part of real-world projects.

  • Impact: your work will be used by millions of people every day.

  • The potential to publish and/or patent your innovations.

What are we looking for in candidates?

Along with your solid technical skills, candidates should demonstrate problem-solving and

analytical abilities, good communication and collaboration skills, a curiosity for how and why

things work as they do, and a passion for audio, video, movies, music, or game technology.

Areas of Focus

  • Multimodal machine learning and deep learning.

  • Adversarial machine learning.

  • Multimodal LLMs.

  • Audiovisual content analysis and enhancement.

  • Multimodal representation learning.

  • Generative AI for audio and video.

Qualifications

  • Working towards a Master's or Ph.D. degree in Artificial Intelligence, Electrical

Engineering, Computer Science, or related field.

  • Experience developing and training deep learning architectures.

  • Experience working with deep learning architecture for audio and/or video applications.

  • Experience tackling and understanding representation learning problems.

  • Experience working on adversarial machine learning problems is a plus.

  • First-author publications at peer-reviewed AI conferences (CVPR, ICCV, ECCV, NeurIPS,

ICML, InterSpeech, ICASSP, etc.).

  • Programming experience in Python, and experience working with frameworks like

PyTorch or TensorFlow.

  • Ability to prototype quickly, with adept critical thinking skills.

  • Excellent communication skills and a team-oriented work ethic.

Eligibility

Working towards a Ph.D. degree in Computer Science, Electrical Engineering, or a related field;

recent grads within six months of graduation are also eligible to apply. Must be available to

work full-time, Monday to Friday, for 3 months between September 2024 - December 2024.

Start date for the internship is as follows: (note this date is not flexible)

  • Monday, September 23, 2024

The San Francisco/Bay Area base hourly range for this internship position is $44-57/hr and can vary if outside of this location. Our hourly ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific hourly range and perks and benefits for your location during the hiring process.

Dolby will consider qualified applicants with criminal histories in a manner consistent with the requirements of San Francisco Police Code, Article 49, and Administrative Code, Article 12

Equal Employment Opportunity:

Dolby is proud to be an equal opportunity employer. Our success depends on the combined skills and talents of all our employees. We are committed to making employment decisions without regard to race, religious creed, color, age, sex, sexual orientation, gender identity, national origin, religion, marital status, family status, medical condition, disability, military service, pregnancy, childbirth and related medical conditions or any other classification protected by federal, state, and local laws and ordinances.


icon no score

See how you match
to the job

Find your dream job anywhere
with the LiveCareer app.
Mobile App Icon
Download the
LiveCareer app and find
your dream job anywhere
App Store Icon Google Play Icon
lc_ad

Boost your job search productivity with our
free Chrome Extension!

lc_apply_tool GET EXTENSION

Similar Jobs

Want to see jobs matched to your resume? Upload One Now! Remove

Phd Multimodal AI Intern

Dolby Sound Laboratories