Open In Colab This example demonstrates how to use the Predibase SDK to use Reinforcement Finetuning to train a model to play Countdown. If you run into any issues regarding the accelerator or quantization, please take a look at how to specify the compute spec here.