<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Transformers on Hitesh Pattanayak</title><link>/tags/transformers/</link><description>Recent content in Transformers on Hitesh Pattanayak</description><generator>Hugo</generator><language>en-us</language><lastBuildDate>Sun, 22 Mar 2026 11:43:37 -0700</lastBuildDate><atom:link href="/tags/transformers/index.xml" rel="self" type="application/rss+xml"/><item><title>The Evolution: Beyond Transformers</title><link>/posts/the-evolution-beyond-transformers/</link><pubDate>Sun, 22 Mar 2026 11:43:37 -0700</pubDate><guid>/posts/the-evolution-beyond-transformers/</guid><description>A practical walkthrough of how the Transformer architecture evolved from encoder-decoder to decoder-only models, why attention&amp;rsquo;s quadratic scaling became a hard wall, and how Mamba&amp;rsquo;s state space machines are being absorbed into hybrid architectures that dominate production today.</description></item><item><title>Training for Greatness: Speed, BLEU Records, and the Multimodal Vision</title><link>/posts/training-for-greatness-speed-bleu-records-and-the-multimodal-vision/</link><pubDate>Sat, 21 Mar 2026 12:20:48 -0700</pubDate><guid>/posts/training-for-greatness-speed-bleu-records-and-the-multimodal-vision/</guid><description>A practical deep-dive into how the original Transformer model shattered translation benchmarks, slashed training costs, and laid the architectural foundation for every major LLM that followed.</description></item><item><title>Inside the Machine: Encoders, Decoders, and Masking</title><link>/posts/inside-the-machine-encoders-decoders-and-masking/</link><pubDate>Sat, 21 Mar 2026 12:19:38 -0700</pubDate><guid>/posts/inside-the-machine-encoders-decoders-and-masking/</guid><description>A practical deep-dive into how the Transformer&amp;rsquo;s encoder and decoder stacks work, covering residual connections, positional encoding, masked self-attention, and cross-attention with code examples throughout.</description></item><item><title>The End of the RNN Era &amp; The Query, Key, Value Revolution</title><link>/posts/the-end-of-the-rnn-era-the-query-key-value-revolution/</link><pubDate>Sat, 21 Mar 2026 12:06:03 -0700</pubDate><guid>/posts/the-end-of-the-rnn-era-the-query-key-value-revolution/</guid><description>A practical walkthrough of why RNNs hit a fundamental wall with sequential processing and long-range dependencies, and how the Query-Key-Value attention mechanism solves both problems in one elegant step.</description></item></channel></rss>