Nurgaliyev Shakhizat
Published © GPL3+

Inferencing with vLLM and Triton on NVIDIA Jetson AGX Orin

This tutorial shows how to run Large language models using the NVIDIA Triton and vLLM on the NVIDIA Jetson AGX Orin 64GB Developer Kit.

AdvancedFull instructions provided3 hours119
Inferencing with vLLM and Triton on NVIDIA Jetson AGX Orin

Things used in this project

Story

Read more

Credits

Nurgaliyev Shakhizat

Nurgaliyev Shakhizat

69 projects • 165 followers
I am a hardcore robotics and IoT enthusiast. Email: shahizat005@gmail.com

Comments