Machine Learning Infrastructure Engineer

XPeng Motors

  • Full Time

XPeng Motors is one of China’s leading smart electric vehicle (“EV”) companies. We design, develop, manufactures and market smart EVs that are seamlessly integrated with advanced Internet, AI and autonomous driving technologies. We are committed to in-house R&D and intelligent manufacturing to create a better mobility experience for our customers. We strive to transform smart electric vehicles with technology and data, shaping the mobility experience of the future. 

We are looking for machine learning infrastucture engineer with strong programming and development skills and experience with machine learning. Experiences with machine learning algorithm, applications and frameworks are required. 

Our mission is to solve the autonomous driving problem. You will work with a team of machine learning researchers to build AI software systems, learn about deep learning algorithms, and use your technical skills to advance autonomous driving.

(C++ focus)

Job Responsibilities:

  • Profile large-scale training jobs and identify/resolve potential bottlenecks to better utilize hardware accelerators.
  • Implement efficiency CUDA kernels to speed up training forward and backward speed.
  • Implement efficiency gradient sync method to speed up training speed.

Minimum Skill Requirements:

  • Programming fluently in C/C++ with a deep understanding of software design, programming techniques, and algorithms.
  • Experience with parallel programming, ideally CUDA C/C++ and OpenACC.
  • Master one of several mainstream machine learning development framework such as TensorFlow, PyTorch, MXNet.
  • Strong mathematical fundamentals, including linear algebra and numerical methods, is plus

(Python focus)

 Job Responsibilities:

  • Improve training SDK architecture including data-io, augmentation, visualization, evaluation and testing pipeline.
  • Implement functions and APIs to support big data tooling, resource monitor and other on-cluster functionalities.
  • Work cross team to deploy machine learning solutions to embedded systems and/or cloud cluster.

Minimum Skill Requirements:

  • Programming fluently in Python with a deep understanding of software design, programming techniques, and algorithms.
  • Master one of several mainstream machine learning development framework such as TensorFlow, PyTorch, MXNet.
  • Familiar with machine learning ecosystems and latest trends.

What do we provide:

  • A fun, supportive and engaging environment
  • Opportunity to make significant impact on transportation revolution by the means of advancing autonomous driving
  • Opportunity to work on cutting edge technologies with the top talent in the field
  • Competitive compensation package
  • Snacks, lunches, and fun activities

 

We are an Equal Opportunity Employer. It is our policy to provide equal employment opportunities to all qualified persons without regard to race, age, color, sex, sexual orientation, religion, national origin, disability, veteran status or marital status or any other prescribed category set forth in federal or state regulations.