Back arrow
Back
Meta

AI/HPC Systems Engineer

Apply to this Job
Job post Item Icon
Job post Item Icon
Oslo
Norway
Job post Item Icon
Job post Item Icon
Full-time
Job post Item Icon
On-site
Job post Item Icon
All
Job post Item Icon
Corporate

Job Description

The AI/HPC Systems Engineering team is looking for a systems engineer with knowledge of AI or HPC systems. This role will involve shaping the next generation of AI/HPC systems. Meta’s cutting-edge AI/HPC systems are helping drive the next generation of innovation and in this role, you will have a unique opportunity to shape the direction of Meta by specifying technical requirements and steering the industry and ecosystem.This role will be working across many projects in our team to shape our system requirements & design. A successful candidate will be a HW system and platform builder with a breadth of knowledge. Their background could span boards, sensors, FPGA RTL design/verification, performance/power testing, OS kernel/driver software and architectures.The position requires a developer, able to debug and extract solutions from vague descriptions of system architecture and workloads. A successful candidate would also be expected to build cross functional relationships across teams to find ideas, assets and assistance to explore faster.

AI/HPC Systems Engineer Responsibilities

  • Shape the next generation of AI/HPC architectures to maximize system performance & reliability.
  • Prototype new AI/HPC workload processing ideas & understand their impact on system design.
  • Develop code and/or infrastructure for simulation platforms intended to derive performance requirements both for our hardware systems and ASICs.
  • Develop code and/or infrastructure for the performance validation of our hardware systems & ASICs.
  • Filter, analyze, and interpret data from our deep learning workloads as well as test systems to drive architectural insights & value proposition.
  • Develop tooling to root cause observed performance bottlenecks.

Minimum Qualifications

  • Master’s/PhD degree in Computer Science, Information Engineering or similar field or BS and 3+ years Industry experience.
  • Experience with discovering problem statements in large scale and complex systems and coming up with solution.
  • Knowledge of distributed deep learning systems and parallel machine learning workloads.
  • Familiarity with programming domain specific accelerators (e.g DPUs, AI accelerators etc).
  • Experience with management under ambiguity in a fast changing field.
  • Experience working effectively as an individual and in a multidisciplinary team.
  • Must obtain work authorization in the country of employment at the time of hire and maintain ongoing work authorization during employment.

Preferred Qualifications

  • Familiarity with GPU programming (such as Cuda) and/or performance validation.
  • Familiarity with high bandwidth proprietary interconnects for accelerators (such as NVLink, XGMI).
  • Familiarity with architectural trade-offs in board design, high speed signal integrity, bus architecture, rack design and network topology.
  • Experience of benchmarking distributed deep learning systems and parallel machine learning workloads.
  • Experience in power test and evaluation in prototyping platforms.
  • Experience with lab system debug with logic analyzers, scopes, meters, etc.
  • Basic knowledge about chip architecture, µarchitecture and design.
  • Strength in mathematics, numerical modeling, stochastic processes, and optimization techniques.
  • Familiarity with FPGA hardware tuning (SerDes, voltage, etc.).
  • Proficiency in C, C++ and/or Python.

About Meta

Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology. People who choose to build their careers by building with us at Meta help shape a future that will take us beyond what digital connection makes possible today—beyond the constraints of screens, the limits of distance, and even the rules of physics.

Meta is committed to providing reasonable support (called accommodations) in our recruiting processes for candidates with disabilities, long term conditions, mental health conditions or sincerely held religious beliefs, or who are neurodivergent or require pregnancy-related support. If you need support, please reach out to accommodations-ext@fb.com.

About the Company

Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology. We want to give people the power to build community and bring the world closer together. To do that, we ask that you help create a safe and respectful online space. These community values encourage constructive conversations on this page: • Start with an open mind. Whether you agree or disagree, engage with empathy. • Comments violating our Community Standards will be removed or hidden. So please treat everybody with respect. • Keep it constructive. Use your interactions here to learn about and grow your understanding of others. • Our moderators are here to uphold these guidelines for the benefit of everyone, every day. • If you are seeking support for issues related to your Facebook account, please reference our Help Center (https://www.facebook.com/help) or Help Community (https://www.facebook.com/help/community).

Get weekly updates and be the first to know when new jobs go live!

Thanks for joining!
Oops! Something went wrong while submitting the form.
Close image