๐ LangSmith is a new platform designed to debug, test, and monitor LLM applications, bridging the gap between prototypes and production.
๐ฅ๏ธ The platform provides a user interface for creating and managing projects and data sets, but it is more efficient to use code for most tasks.
๐ To get started, users need to create a project, retrieve an API key, and set up environment variables. The API key is important for tracking LLM code with sensitivity in mind.
๐ฆ In order to bring LLM systems to production, we need to install LinkSmith, LangChain, and Python packages.
โ๏ธ By loading environment variables and running a chain, we can get the output from OpenAI.
๐ To have more control, we can use LangSmith and LangChain libraries and create trace instances.
๐ Tagging LM codes allows for filtering and organizing different steps of the LM chain.
๐ Grouping different LM calls can be done using the Trace Ace chain group function.
๐ฅ๏ธ The project UI provides a visual representation of the tagged and grouped LM calls.
๐ฑ๏ธ You can structure your LLM calls using tags or by listing them with code.
๐ You can filter LLM runs based on the start time, run type, or metadata.
๐ป You can create a data set and use it to evaluate the quality of an LLM.
๐ The tutorial explains how to create a dataset and upload it to LangSmith.
๐ Different data formats, such as tuples and CSV files, can be used for storing data in LangSmith.
๐งช The tutorial also covers evaluating LLMs using the RunEvalConfig and RunOnDataset methods.
๐ The video explains how to bring LLM systems to production.
๐ป The process involves creating a client, running the data set method, and utilizing evaluation configuration.
๐ A custom prompt template can be created to categorize query, answer, and prediction results.
๐ The tutorial discusses the usage of the context QA and chord QA classes in the LangSmith system.
๐ป The q and a class takes an evaluator type, with the llm set to none and the prompt set to default.
๐ The tutorial demonstrates running the system on a data set and viewing the output in the UI.