Working with langchain document loaders
Langchain is a framework that enables the development of applications that utilize language models. It provides support for various main modules, including Models, Prompts, Memory, Indexes, Chains, and Agents. LangChain believes that applications with the greatest power and differentiation will not only rely on calling out to a language model via an API but will also be aware of data and allow a language model to interact with its environment.
What are document loaders in Langchain?
The document loaders in Langchain are used to get around the ~4K token limit on OpenAI by using several strategies such as stuffing, map-reduce, refining, and map-rerank. These strategies are used to split documents into smaller sections or summaries that can fit into the 4K token limit and consolidate them as a summary that can fit into the 4K token limit. These strategies are also used to refine prompts to ChatGPT to refine its context prior to composing a final summary that will then be used to prime the conversation for the response.
These data are saved and indexed as vectors in a vector database. Vector indexes are used to capture word meaning and represent it in a continuous space. This helps the model to understand the context and generate more accurate responses.
Question Answer Systems
In order to get question and answering from prompts, we follow 4 steps with langchain:
- Create an index
- Create a Retriever from that index
- Create a question answering chain
- Ask questions!
Example agent with CSVs
Below is an example using a CSV file through an agent. This CSV contains red-alert
channel data from Dwarves’ discord as the context for our questions that we will be asking later.
- First we load all of our
pip
packages
pip install langchain openai huggingface_hub gitpython chromadb tiktoken google-api-python-client llama-cpp-python
- Then, with our
red_alert.csv
file, we create a loader with it on langchain and create a reference object by loading it.
from langchain.document_loaders.csv_loader import CSVLoader
red_alert_csv_loader = CSVLoader(file_path='./example_data/red_alert.csv')
red_alert_docs = red_alert_csv_loader.load()
- We then split our text and instantiate our embeddings. For our case, we will use OpenAI’s embeddings.
from langchain.chains import RetrievalQA
from langchain.indexes import VectorstoreIndexCreator
from langchain.text_splitter import CharacterTextSplitter
from langchain.embeddings import OpenAIEmbeddings
from langchain.vectorstores import Chroma
text_splitter = CharacterTextSplitter(chunk_size=1000, chunk_overlap=0)
red_alert_texts = text_splitter.split_documents(red_alert_docs)
embeddings = OpenAIEmbeddings()
- We store our vector embeddings, that gets mapped from OpenAI, in ChromaDB and create a retriever while we’re at it.
red_alert_db = Chroma.from_documents(red_alert_texts, embeddings)
red_alert_retriever = red_alert_db.as_retriever()
- We can then create our Retrieval QA system from our retriever and llm (OpenAI).
red_alert_qa = RetrievalQA.from_chain_type(llm=openai_llm, chain_type="stuff", retriever=red_alert_retriever)
We can also immediately query from this system:
query = 'Who wrote the article on "i18n Frontend Guideline"?'
red_alert_qa.run(query)
# <@874125795549909082> wrote the article on "i18n Frontend Guideline".
- To create our agent, we will need to apply it to our list of tools and initialize our agent (zero shot react) with those tools and OpenAI. We’ll also add a Google Search tool as well for good measure.
from langchain.agents import load_tools
from langchain.agents import AgentType
from langchain.agents import initialize_agent, Tool
from langchain.tools import BaseTool
from langchain import LLMMathChain, GoogleSearchAPIWrapper
google_search = GoogleSearchAPIWrapper()
tools = [
Tool(
name="Dwarves Red-alert QA System",
func=red_alert_qa.run,
description="everything concerning alerts and news on Dwarves Foundation Discord."
),
Tool(
name="Google Search",
func=google_search.run,
description="use on Google'ssearch engine"
)
]
openai_agent = initialize_agent(tools, openai_llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True)
- Finally, we have an agent that can give us based on data from our red alert QA system and the internet.
query = 'What Discord username wrote "i18n Frontend Guideline"? What is his age in roman numerics?'
openai_agent.run(query)
# The username is An Tran and the age in Roman numerics is MMXXI.
Conclusion
Hopefully, this gives you an idea of how langchain can be used to create an agent that take indexes and data from our transient vector database. A lot of the processing for generating these prompts happens behind the scenes, so that we only need to worry about loading and consolidating which data we need to use as indexes for LLMs.