2. Agent fundamentals#
In section 1 of this guide, we introduced agents and RAG, explained their functionality at a high level, and highlighted their advantages over standalone LLMs.
This section will introduce the key components and principles behind developing agent applications over unstructured data. In particular, we will discuss:
Data pipeline: Transforming unstructured documents, such as collections of PDFs, into a format suitable for retrieval using the agent application’s data pipeline.
Retrieval, Augmentation, and Generation (RAG agent): An agent is called to:
Understand the user’s question
Retrieve the supporting data
Call an LLM to generate a response based on the user’s question and supporting data
Evaluation: Assessing the agent application to determine its quality/cost/latency to ensure it meets your business requirements.
Governance & LLMOps: Tracking and managing the lifecycle of each component, including data lineage and governance (access controls).