Module 6 Lesson 2: Embedding Providers (Cloud vs. Local)
Choosing your engine. Comparing OpenAI cloud embeddings with local HuggingFace models for speed and privacy.
Choosing your engine. Comparing OpenAI cloud embeddings with local HuggingFace models for speed and privacy.
The Semantic Database. How to store thousands of vectors so you can search them in milliseconds.
Fine-Tuning Retrieval. Learning how to control how many results (k) your vector store returns and what 'Score' means.
Hands-on: Build a local knowledge base using ChromaDB and perform semantic queries.
Fighting Hallucinations. Understanding the architectural pattern of grounding AI responses in factual, retrieved context.
The Search Object. How LangChain standardizes vector store lookups into a 'Retriever' that can be used in any chain.
Piping Facts. Putting it all together into a single LCEL chain that retrieves context and generates an answer.
The Art of Grounding. How to write the perfect system prompt to ensure your AI stays factual and cites its sources.
Hands-on: Finalize your first production-ready RAG system over your own local documents.
Breaking the Amnesia. Understanding why LLMs are stateless and how we provide 'history' to simulate a conversation.
The Raw Transcript. Using the simplest memory type to keep a literal record of every message in a conversation.
Dense context. How to use an LLM to periodically summarize a conversation to keep the memory footprint small.
Production State. How to move your memory from local RAM to persistent databases for multi-user applications.
Hands-on: Build a persistent chatbot that remembers your name across different CLI sessions.
The Agent's Hands. Understanding how to give an LLM the ability to execute code and interact with the physical world.
Creating Superpowers. How to turn any Python function into a LangChain tool using a simple decorator.
Instant Capabilities. Exploring the library of pre-made tools for web search, calculation, and database interaction.
Hands-on: Build a toolbox for an agent that can multiply numbers and search Wikipedia.