Job Description
Responsibilities:
- Develop AI/LLM kernels/operators on Quadric platform for efficient inference
- Optimize the kernel performance for different hardware configurations and workloads
- Profile and analyze kernel performance in terms of compute, data and parallelism; identify micro-architecture and software bottlenecks and provide optimization solutions
- Optimize kernel C/C++ codes, maximize hardware utilization
- Collaborate across related areas of the AI inference stack to support team and business priorities
- Make Improvement to Quadric toolchain, compiler and runtime
- Provide technical support and documents to customers and developer community
Ideal Candidate
- Bachelor’s or Master’s in Computer Science and
- 5+ years of experience in AI kernel development and optimization
- experience with model and kernel inference performance profiling
- experience with at least one of the following compute development: CUDA, DSP, NEON, Triton-lang
- Proficiency in C/C++ and Python, experience with assembly language a plus
- Demonstrate good capability in problem solving, debug and communication
Are you interested in this position?
Apply by clicking on the “Apply Now” button below!
#GraphicDesignJobsOnline
#WebDesignRemoteJobs #FreelanceGraphicDesigner #WorkFromHomeDesignJobs #OnlineWebDesignWork #RemoteDesignOpportunities #HireGraphicDesigners #DigitalDesignCareers# Dynamicbrand guru
Apply Now