Below you will find pages that utilize the taxonomy term “Ai”
Technical Posts
Retrieval Pipelines, Re-Ranking, and Grounding: Building Production RAG
A practical guide for software engineers on building production-grade RAG systems using hybrid retrieval, re-ranking, and grounding techniques to reduce hallucinations and improve answer quality.
read moreTechnical Posts
Vector Embeddings & Similarity: The Foundation of RAG
A practical deep-dive into vector embeddings and cosine similarity — the mathematical foundation that makes retrieval in RAG systems actually work.
read moreTechnical Posts
Vector Databases, ANN, and Chunking: Storing Knowledge for Retrieval
A practical guide for software engineers covering how vector databases use Approximate Nearest Neighbor algorithms to search millions of embeddings efficiently, and how to chunk documents intelligently so your RAG pipeline actually retrieves useful, precise context.
read moreTechnical Posts
Page-Aware AI Chat: Floating Widget and Per-Page Context
A practical walkthrough of adding per-page context awareness to a floating AI chat widget built with Hugo and Netlify Functions, covering layout overrides, slug injection, priority chunk labeling, and the prompt engineering fix that made summarise-this-post actually work.
read moreTechnical Posts
Building an AI Chat Assistant for a Static Blog — No Vector DB Required
A practical walkthrough of building a conversational AI assistant for a Hugo static site using TF-IDF retrieval over a flat JSON knowledge base — no vector database, no backend server, no embeddings infrastructure required.
read moreTechnical Posts
Understanding Neural Networks: Weights, Biases, and Activations
This article breaks down the key mathematical concepts behind neural networks, including weights, biases, and activations, with an example of handwritten digit recognition.
read moreTechnical Posts
Sub-Word Tokenization: Breaking Words Like a Pro
Take a detour before diving into transformers and explore sub-word tokenization techniques like Byte-Pair Encoding, WordPiece, and Unigram models. Learn how they handle rare words, reduce vocabulary size, and make models more efficient!
read moreTechnical Posts
N-Grams Uncovered: A Key Component of Large Language Models
Decoding N-Grams: The Heart of Large Language Models
read moreTechnical Posts
Beginner’s Guide to AI: Diving Into My First AI Blog Post
Why am I writing these AI blogs? Discover my journey into AI and LLMs!
read more