PyTorch ML Software engineer

Phenikaa-X is a subsidiary of the Phenikaa Group, driven by the ambition to become a leading technology company in the field of autonomous vehicles and industrial robotics in Vietnam. We have successfully developed level 4/5 autonomous vehicles and are currently focusing on researching and developing drone monitoring technologies, receptionist robots, and autonomous driving systems.

We are currently seeking a talented and motivated autopilot engineer to join our team. This role offers an exciting opportunity to work in cutting-edge technologies, with a primary focus on developing and implementing solutions to tackle challenges related to receptionist robot functionalities. As an intern at Phenikaa-X, you will have the opportunity to work alongside experienced professionals and contribute to the development of innovative solutions in the robotics industry.

What to Expect

As an AI Inference Software Engineer within the Autopilot team, you will have the opportunity to fine-tune, deploy, and optimize large neural networks for efficient inference on heterogeneous edge devices (CPU/GPU/AI ASIC). The nature of the role is multi-disciplinary – you will be working at the intersection of machine learning and systems. You will build the frameworks and infrastructure that enable the seamless deployment, integration, and inference of all neural networks that run on Autopilot. You will develop system tools to benchmark, characterize, and optimize the latency and throughput of AI workloads on the FSD chip.

What You’ll Do

Build robust AI frameworks to lower neural networks (PyTorch) to edge devices
Build robust AI infrastructure to train and fine-tune networks for Autopilot on large GPU clusters
Deploy state-of-the-art neural networks on heterogenous compute, to maximize network performance while minimizing latency
Closely collaborate with AI scientists and hardware teams to effectively quantize, prune, and run inference in low-precision
Design and implement custom GPU kernels (OpenCL/CUDA) for efficient training and post-processing of network output

What You’ll Bring

Proficiency with Python and C++, including modern C++ (14/17/20)
Experience with PyTorch, TensorFlow, or other machine learning frameworks
Experience with Machine Learning, Deep Learning, and Computer Vision
Experience with Model Fine-Tuning: Quantization Aware Training, Compression, Pruning
Experience with training and deploying neural networks for real-world AI
Experience with Computer Systems/Architecture
Experience with CUDA and/or OpenCL

Benefit

Salary up to 2200$, negotiation based on candidate’s capacity.
Being pioneering young individuals participating in building and developing new products, new business directions.
Collaborating with outstanding teammates.
Insurance plan based on full salary + 13th month salary + Performance bonus.
Working time: 08:30 AM – 5:30 PM from Mondays to Fridays (meal breaks included).

Other Benefit

Equipped with modern equipment such as a high configuration PC.
Accessed high-configured supercomputers for training AI models.
100% support for company trips: Summer Holiday…

Apply for this position