Fine-tune a model to play Countdown using reinforcement learning
This example demonstrates how to use the Predibase SDK to use Reinforcement
Finetuning to train a model to play
Countdown.If you run into any issues regarding the accelerator or quantization,
please take a look at how to specify the compute spec here.