Nurgaliyev Shakhizat
Published © GPL3+

Inferencing with vLLM and Triton on NVIDIA Jetson AGX Orin

This tutorial shows how to run Large language models using the NVIDIA Triton and vLLM on the NVIDIA Jetson AGX Orin 64GB Developer Kit.

AdvancedFull instructions provided3 hours2,032
Inferencing with vLLM and Triton on NVIDIA Jetson AGX Orin

Things used in this project

Story

Read more

Credits

Nurgaliyev Shakhizat
74 projects • 190 followers
I am a hardcore robotics and IoT enthusiast. Email: shahizat005@gmail.com
Contact

Comments

Please log in or sign up to comment.