MLOps for LLMs: how to build, tune, and test a chatbot without hating your life

Description

When you work with large machine learning models (like LLMs), the development process can get messy, fast. To address this, the workshop equipped participants with MLOps practices and tools to maintain reproducibility, automation, and testability.

The workshop began with the fundamentals of building a chatbot using LLMs. Then, it transitioned to demonstrating how to take a prototype and iteratively adapt it to a real-world scenario using software like data version control (DVC). By the workshop's conclusion, attendees had built a working pipeline that connected LLMs, a custom dataset, and crucial performance metrics.

You can view and download the workshop presentation slides below.

Download slides

Topics

What is MLOps and why do we need it when developing LLMs?
How chatbots use LLMs and retrieval-augmented generation (RAG)
Storing documents in vector embeddings with langchain and faiss
Tracking documents and embeddings with DVC
Retrieving relevant documents and answering questions using an LLM
Chaining the pipeline together with DVC
Measuring performance with ragas
Tracking and comparing performance with DVC
Improving performance by tuning the data, model, or retrieval
Maintaining and updating the application

Requirements

Basic coding skills in Python
Basic familiarity with Git and working in the command line
Install Anaconda at https://www.anaconda.com/download/success.
Sign up for a Hugging Face account at https://huggingface.co/join.

Target audience

Anyone who wants to learn how to practically build an AI application like a chatbot, and how to incorporate MLOps tools into its development. No previous AI or ML experience is required.

Speaker

Elle O'Brien

Lecturer and Research Investigator at University of Michigan

Dave Berenbaum

Technical Product Manager at Iterative.ai