Intro to Triton with Matrix Multiplication
Introduction to GPU programming with Triton and build the matrix multiplication along the way
Read More →
Computer Engineering @GaTech | Research in Efficient LLM & robot localization
A student of electrodynamics and existentialism who finds rhythm in both classical mechanics and Ne-Yo(or Usher). Whether I'm designing efficient infrastructure, hitting a badminton court, or poping, I’m driven by the pursuit of truth and a perfectly timed beat.
"I have never believed more firmly that my sole purpose in this world is to contemplate it, and to exhaust all my fire in the pursuit of truth."
The relationship between transfer function and electric field?
A transfer function $H(s)$ is more than a ratio; it is the signature of a medium's soul. In electromagnetics, space acts as a processor—a spatial filter where $E_{out}(s) = H(s) \cdot E_{in}(s)$. Permittivity $\epsilon(\omega)$ is the material's personality, governing how it breathes in response to the pressure of an external field.
How does Diffusion Blocks changes the inference and what factor determine the most efficient block size?
Diffusion Blocks are architectural heartbeats. Larger blocks offer contextual depth for global alignment but demand heavy GPU tolls. Efficiency is found at the Information Bottleneck: matching the block's receptive field to the Latent Resolution to ensure every FLOP sculpts meaning rather than redundant noise.
Why Circuits have relatively larger LTI range than other field (language, material science, etc.)?
Circuits enjoy mathematical grace because we manufacture them to obey our symbols. Through negative feedback, we create 'linear islands' in a non-linear sea. Unlike language, where context is volatile, circuits thrive on Superposition—where the response is blissfully indifferent to surrounding signals.
How to develop sharp intuition on the electrical components work?
To 'see' electricity, look for where the energy is hiding. See Voltage as the desperate urge of electrons to find peace and Current as their kinetic rush. Intuition is forged in the sandbox of failure, understanding that electricity is a fluid, living force seeking equilibrium.
Introduction to GPU programming with Triton and build the matrix multiplication along the way
Read More →Implementation details and best practices for Quantization Aware Training (QAT) with LoRA, including GPU memory optimization strategies
Read More →Comprehensive report on quantization strategies including switchable precision and cyclic precision training applied to WikiText-103 dataset
Read More →