Nurgaliyev Shakhizat
Published © GPL3+

Inferencing with vLLM and Triton on NVIDIA Jetson AGX Orin

This tutorial shows how to run Large language models using the NVIDIA Triton and vLLM on the NVIDIA Jetson AGX Orin 64GB Developer Kit.

AdvancedFull instructions provided3 hours282
Inferencing with vLLM and Triton on NVIDIA Jetson AGX Orin

Things used in this project

Story

Read more

Credits

Nurgaliyev Shakhizat

Nurgaliyev Shakhizat

70 projects • 168 followers
I am a hardcore robotics and IoT enthusiast. Email: shahizat005@gmail.com

Comments