Posts tagged "research"

3 posts found

Intro to Triton with Matrix Multiplication

March 18, 2025 by Xiaoyou Wu

Introduction to GPU programming with Triton and build the matrix multiplication along the way

January 15, 2025 by Xiaoyou Wu

Implementation details and best practices for Quantization Aware Training (QAT) with LoRA, including GPU memory optimization strategies

January 10, 2025 by Xiaoyou Wu

Comprehensive report on quantization strategies including switchable precision and cyclic precision training applied to WikiText-103 dataset