LLM Architecture
Attention, KV cache, normalization, positional encoding, and memory mechanisms.
I write technical notes that turn model architecture and infrastructure details into clear engineering references. Current focus: attention mechanisms, LLM memory, PyTorch, Linux, and networking fundamentals.
About
This site is my public workspace for studying deep learning systems and writing down the parts that are easy to forget: tensor operations, Transformer components, decoding efficiency, Linux usage, and practical network paths.
I care about explanations that connect math, code, and systems behavior, so each note is written as a reusable reference for future projects and interviews.
Focus Areas
Attention, KV cache, normalization, positional encoding, and memory mechanisms.
PyTorch workflows, Hugging Face tooling, tensor operations, and reproducible experiments.
Linux, shell tooling, Docker basics, Git.
Skills
Projects
Writing
Loading notes...