Ranjan Kumar
Home
Blog
Books
My Tools
Contact
← All articles
genai
38 articles
Title
Date
BM25 vs Dense Retrieval for RAG: What Actually Breaks in Production
Jan 17, 2026
Stop Pasting Screenshots: How AI Engineers Document Systems with Mermaid
Dec 30, 2025
Building Production-Ready AI Agents with LangGraph: A Developer's Guide to Deterministic Workflows
Dec 29, 2025
Choosing the Right LLM Inference Framework: A Practical Guide
Dec 24, 2025
Agent Building Blocks: Build Production-Ready AI Agents with LangChain | Complete Developer Guide
Dec 22, 2025
When Your Chatbot Needs to Actually Do Something: Understanding AI Agents
Dec 19, 2025
How Google's SynthID Actually Works: A Visual Breakdown
Dec 16, 2025
The Tyranny of the Mean: Population-Based Optimization in Healthcare and AI
Dec 8, 2025
The Splintered Web: India 2025
Dec 7, 2025
The AI Ouroboros: How Gen AI is Eating Its Own Tail
Dec 5, 2025
Building Agents That Remember: State Management in Multi-Agent AI Systems
Nov 30, 2025
Building Production-Ready Agentic AI: The Infrastructure Nobody Talks About
Nov 27, 2025
Introducing My New Book: The ChatML (Chat Markup Language) Handbook
Nov 17, 2025
A Deep Dive into Cross Encoders and How they work
Oct 2, 2025
When Models Stand Between Us and the Web: The Future of the Internet in the Age of Generative AI
Sep 26, 2025
Cursor AI Code Editor: Boost Developer Productivity with MCP Servers
Sep 18, 2025
Provenance in AI: Auto-Capturing Provenance with MLflow and W3C PROV-O in PyTorch Pipelines – Part 4
Aug 29, 2025
Provenance in AI: Tracking AI Lineage with Signed Provenance Logs in Python - Part 2
Aug 28, 2025
Provenance in AI: Building a Provenance Graph with Neo4j – Part 3
Aug 28, 2025
Navigating AI Risks with NIST’s AI Risk Management Framework (AI RMF)
Aug 28, 2025
Provenance in AI: Why It Matters for AI Engineers - Part 1
Aug 27, 2025
LLMs for SMEs - 001: How Small Businesses Can Leverage AI Without Cloud Costs
Aug 22, 2025
LLM-Powered Chatbots: A Practical Guide to User Input Classification and Intent Handling
Aug 12, 2025
Reranking for RAG: Boosting Answer Quality in Retrieval-Augmented Generation
Aug 11, 2025
ChatML Guide: Master Structured Prompts for LLMs
Aug 10, 2025
Question Answer Chatbot using RAG, Llama and Qdrant
May 19, 2025
On Emergent Abilities of Large Language Models
Mar 26, 2025
Prompt Engineering Deep Dive: Parameters, Chains, Reasoning, and Guardrails
Mar 9, 2025
LLM Text Clustering and Topic Modeling: HDBSCAN and BERTopic Tutorial
Mar 4, 2025
Text Classification using Large Language Models (LLMs)
Mar 3, 2025
Inside the LLM Inference Engine: Architecture, Optimizations, Tools, Key Concepts and Best Practices
Feb 9, 2025
Fact-Checking in LLM Systems: From Hallucinations to Verifiable AI
Feb 5, 2025
Summary of the paper DeepSeek-R1
Jan 30, 2025
How do you choose among competing open-source products? Example comparison of open-source vector databases.
Jan 28, 2025
Hands-on Tutorial on Making an Audio Bot using LLM, and RAG
Jan 27, 2025
My notes on AI-Generated Content (AIGC)
Jan 7, 2025
Making a talking bot using Llama3.2:1b running on Raspberry Pi 4 Model-B 4GB
Jan 2, 2025
Install, run, and access Llama using Ollama
Dec 17, 2024