Predibase supports fine-tuning LLMs for classification tasks. This adds LoRA weights as well as a new classification head, that are optimized during the training process. This is especially useful if you know you want your model to always predict from a set of predefined labels. Inference will be faster, regardless of the number of tokens per label. The model will never hallucinate classes. Finally, accuracy will be higher when compared to SFT for classification.Documentation Index
Fetch the complete documentation index at: https://docs.predibase.com/llms.txt
Use this file to discover all available pages before exploring further.
Ideal Use Cases
Routing and Orchestration
- Training a model router.
- Product classification.
Sentiment & Feedback
- Sentiment analysis.
- Customer feedback labeling.
Safety & Guardrails
- Guardrails models.
- Toxicity/PII detection.
Quick Start
To get started with classification fine-tuning- Prepare a dataset with a
textfield and alabelfield. - Kick off a training job using a ClassificationConfig.
For classification training, there is no option to automatically apply a chat template. If you want to do so, you should apply it to the
text before uploading the dataset.Furthermore, turbo and turbo_lora are not applicable adapter types.Next Steps
- Learn how to deploy and query your fine-tuned adapter.