In today's software development world, there are several Machine Learning (ML) tools that empower developers to create specific chatbots that cater to various needs. These chatbots harness the capabilities of Large Language Models (LLMs) trained on confidential, private document sets on your local machine or enterprise data repositories. They enable you to interact and engage with these document sets, from summarizing emails and meeting notes to gauging sentiment, extracting dates and names from websites, pre-screening resumes, and performing many other tasks.
But most of these systems are Python-based. While Python serves as a formidable language for ML, many software developers find themselves operating within the Java ecosystem, especially within large organizations that prioritize Java for robust and reliable production deployments. This presentation focuses on Java-centric tools to construct tuned assistant systems using the retrieval-augmented generation (RAG) technique.
Quick Overview of Neural Networks, Weights, and Embeddings
Exploring the Current State of LLMs
Implementing Prompts and Completions via a Java API to ChatGPT
Understanding Prompt Structure and Prompt Engineering Techniques
Learn about Instruction-Tuning and Fine-Tuning Techniques
Navigating Vector Databases and Embeddings with Java
Crafting a Private Chatbot Architecture
Demo