genai

36 articles

Title	Date
LLM Chatbot Intent Classification: The Label Isn't Enough	Jul 22, 2026
LLM Chatbot Intent Classification: The Label Isn't Enough	Jul 22, 2026
BM25 vs Dense Retrieval for RAG: What Actually Breaks in Production	Jan 17, 2026
Stop Pasting Screenshots: How AI Engineers Document Systems with Mermaid	Dec 30, 2025
Choosing the Right LLM Inference Framework: A Practical Guide	Dec 24, 2025
How Google's SynthID Actually Works: A Visual Breakdown	Dec 16, 2025
The Tyranny of the Mean: Population-Based Optimization in Healthcare and AI	Dec 8, 2025
The Splintered Web: India 2025	Dec 7, 2025
The AI Ouroboros: How Gen AI is Eating Its Own Tail	Dec 5, 2025
Building Agents That Remember: State Management in Multi-Agent AI Systems	Nov 30, 2025
Building Production-Ready Agentic AI: The Infrastructure Nobody Talks About	Nov 27, 2025
Introducing My New Book: The ChatML (Chat Markup Language) Handbook	Nov 17, 2025
A Deep Dive into Cross Encoders and How they work	Oct 2, 2025
When Models Stand Between Us and the Web: The Future of the Internet in the Age of Generative AI	Sep 26, 2025
Cursor AI Code Editor: Boost Developer Productivity with MCP Servers	Sep 18, 2025
Provenance in AI: Auto-Capturing Provenance with MLflow and W3C PROV-O in PyTorch Pipelines – Part 4	Aug 29, 2025
Provenance in AI: Tracking AI Lineage with Signed Provenance Logs in Python - Part 2	Aug 28, 2025
Provenance in AI: Building a Provenance Graph with Neo4j – Part 3	Aug 28, 2025
Navigating AI Risks with NIST’s AI Risk Management Framework (AI RMF)	Aug 28, 2025
Provenance in AI: Why It Matters for AI Engineers - Part 1	Aug 27, 2025
LLMs for SMEs - 001: How Small Businesses Can Leverage AI Without Cloud Costs	Aug 22, 2025
Reranking for RAG: Boosting Answer Quality in Retrieval-Augmented Generation	Aug 11, 2025
ChatML Guide: Master Structured Prompts for LLMs	Aug 10, 2025
Question Answer Chatbot using RAG, Llama and Qdrant	May 19, 2025
On Emergent Abilities of Large Language Models	Mar 26, 2025
Prompt Engineering Deep Dive: Parameters, Chains, Reasoning, and Guardrails	Mar 9, 2025
LLM Text Clustering and Topic Modeling: HDBSCAN and BERTopic Tutorial	Mar 4, 2025
Text Classification using Large Language Models (LLMs)	Mar 3, 2025
Inside the LLM Inference Engine: Architecture, Optimizations, Tools, Key Concepts and Best Practices	Feb 9, 2025
Fact-Checking in LLM Systems: From Hallucinations to Verifiable AI	Feb 5, 2025
Summary of the paper DeepSeek-R1	Jan 30, 2025
How do you choose among competing open-source products? Example comparison of open-source vector databases.	Jan 28, 2025
Hands-on Tutorial on Making an Audio Bot using LLM, and RAG	Jan 27, 2025
My notes on AI-Generated Content (AIGC)	Jan 7, 2025
Making a talking bot using Llama3.2:1b running on Raspberry Pi 4 Model-B 4GB	Jan 2, 2025
Install, run, and access Llama using Ollama	Dec 17, 2024

Books by Ranjan Kumar

Books by Ranjan Kumar

Building Real-World Agentic AI Systems

The ChatML Handbook

The Chat Templates Handbook