Agentic solutions propose an improvement over traditional Large Language Model (LLM) usage by employing networks of Small Language Models (SLMs). These systems aim to strike a balance between scalability, control, and performance, addressing specific tasks with precision while maintaining overall system adaptability.
Ideas from MLOPs talk by MaltedAI.
Agentic solutions represent a pragmatic approach to AI systems by focusing on modularity, task-specific efficiency, and the thoughtful integration of human expertise. These architectures show promise for enhancing scalability, control, and adaptability in real-world applications.
Contrasting SLMs and LLMs
SLM (Small Language Models): - Intent-based conversations and decision trees. - Controlled systems, harder to build features but easier to execute. - Task-specific and efficient in offline environments.
LLMs (Large Language Models): - Flexible and natural in handling diverse queries. - Suitable for general-purpose scenarios and exploratory tasks. - High computational and scaling costs.
Combined Approach:
- Use SLM for inference and LLMs for training.
- Shift the focus from making models larger to solving real-world problems effectively.
Key Concepts in Agentic Solutions
-
Neural Decision Parser:
- Acts as the “brain” of the system, determining the appropriate action given user input.
- SLMs interpret user utterances to express code aligned with system intent.
-
Phased Policy:
- Distinguishes between contextual and general-purpose questions.
- Ensures deliberate task execution in stages for clarity and efficiency.
-
Knowledge Graphs and Interaction Models:
- Complex graph structures enable intelligent conversations between models.
- RAG setups leverage teacher-student frameworks for effective task distribution.
-
Distillation Networks of SLMs:
- SMEs (Subject Matter Experts) guide teacher models that distill their expertise into student SLMs.
- Enhances task scalability while controlling costs.
-
Scaling with Distillation:
- Leverage teacher-student frameworks for high-quality, scalable data.
- Allow teacher models to handle hard-to-scale aspects.
-
Knowledge Discovery:
- Extract SME knowledge effectively while filtering irrelevant data.
- Build high-quality datasets for task-specific applications.
Applications of SLM Networks
-
Task-Specific Systems:
- Offline processing, task search, and targeted QA.
- Optimized embedding spaces for domain-specific applications.
-
Swarm Intelligence:
- Decision-making through deliberation among SLMs.
- Aggregates diverse inputs (HR, tech, CEO) for robust conclusions.
-
Business Process Models:
- Search and page ranking systems.
- Smaller, focused systems tailored to specific business needs.
Designing Agentic Solutions
-
Role of SMEs:
- Define tasks and input structures.
- Guide model development with domain knowledge.
-
Data Preparation:
- Comprehensive sampling of the problem space ensures generalization.
- Data variability is critical for robust models.
-
Evaluation and Responsiveness:
- Measure system performance to enable continuous improvement.
- Focus on responsive, real-time processing.
-
Tool Integration:
- Use LLMs with Python engines or computational tools like Wolfram for data analysis and complex interactions.
Advantages of SLM Networks
- Precision: Models perform only what they are designed for.
- Efficiency: Smaller models are scalable and cost-effective.
- Focused Applications: Avoids the complexity of embedding spaces for entire businesses.