- Updated: November 5, 2025
- 3 min read
Anyscale and Novasky Team Release SkyRL‑TX v0.1.0 – Tinker‑Compatible RL Engine for Local GPU Clusters
SkyRL-TX v0.1.0: A Game-Changer in On-Prem Reinforcement Learning Engines
Introduction
In the rapidly evolving landscape of artificial intelligence, the demand for robust on-prem reinforcement learning (RL) engines is surging. Enter SkyRL-TX v0.1.0, a groundbreaking release that promises to redefine how developers approach RL on their own infrastructure. This new version is not just an upgrade; it’s a transformation that bridges the gap between cutting-edge technology and practical application.
Overview of SkyRL-TX v0.1.0
SkyRL-TX v0.1.0 is engineered to be Tinker compatible, providing a seamless transition for developers familiar with Tinker’s API. It integrates advanced features such as LoRA adapters, gradient checkpointing, and micro-batching, all designed to optimize performance and efficiency. This release is a testament to the collaborative efforts of Anyscale and NovaSky, aiming to deliver a unified training and inference engine that supports end-to-end reinforcement learning.
Technical Highlights
The latest release of SkyRL-TX introduces several technical enhancements:
- Tinker Compatibility: SkyRL-TX maintains the Tinker programming model, allowing users to run a Tinker-like service on their local hardware without the need for a hosted environment.
- Faster Jitted Sampling: Sampling has been significantly accelerated, thanks to jitted and properly batched processes.
- LoRA Adapters: The engine supports multiple LoRA adapters, enhancing model flexibility and adaptability.
- Gradient Checkpointing: This feature ensures efficient memory usage during training, allowing for larger models and more complex computations.
- Micro-Batching: Micro-batching optimizes resource utilization, enabling smoother and more efficient training sessions.
- PostgreSQL Integration: SkyRL-TX now supports PostgreSQL as a database backend, alongside SQLite, providing robust data management capabilities.
Benefits for Users
SkyRL-TX v0.1.0 offers substantial benefits for AI researchers, machine learning engineers, and data scientists:
- Performance Gains: The integration of advanced features results in significant performance improvements, reducing training time and enhancing model accuracy.
- Ease of Deployment: The engine’s compatibility with local GPU clusters simplifies deployment, making it accessible for teams without extensive infrastructure.
- Scalability: With support for multiple LoRA adapters and micro-batching, SkyRL-TX is designed to scale with the needs of growing projects.
How to Get Started
Getting started with SkyRL-TX v0.1.0 is straightforward. Follow these steps to install and begin using the engine:
- Clone the SkyRL repository from GitHub to your local machine.
- Navigate to the SkyRL-TX directory and initiate the engine using the following command:
- Download the Tinker Cookbook from the Thinking Machines team.
- Execute the RL loop using the command:
uv run --extra gpu --extra tinker -m tx.tinker.api \
--base-model Qwen/Qwen3-4B \
--max-lora-adapters 3 \
--max-lora-rank 1 \
--tensor-parallel-size 8 \
--train-micro-batch-size 8 > out.log
export TINKER_API_KEY=dummy export WANDB_API_KEY=<your key>
uv run --with wandb --with tinker rl_loop.py \
base_url=http://localhost:8000 \
model_name="Qwen/Qwen3-4B" \
lora_rank=1 \
max_length=1024 \
save_every=100
These steps will help you set up the engine and start leveraging its powerful features for your RL projects.
Internal Resources
For those interested in exploring related technologies and integrations, UBOS offers a wealth of resources:
- Telegram integration on UBOS for seamless communication solutions.
- OpenAI ChatGPT integration to enhance your AI capabilities.
- Chroma DB integration for robust database solutions.
- UBOS platform overview to understand the full capabilities of UBOS.
- Workflow automation studio for streamlining processes.
Conclusion
SkyRL-TX v0.1.0 is not just a release; it’s a revolution in reinforcement learning. Its compatibility with Tinker, coupled with advanced features like faster jitted sampling and PostgreSQL integration, make it an indispensable tool for AI professionals. We encourage you to explore the possibilities with SkyRL-TX and leverage its capabilities to push the boundaries of what’s possible in AI model training.
Visit the UBOS homepage to learn more about our offerings and how we can support your AI journey.