Important Change: Starting October 8, 2025, all new Thunder Compute instances use ephemeral storage. This means instances can only be created and deleted, not started and stopped. All data is lost when an instance is deleted.
What is Ephemeral Storage?
Ephemeral storage means that your instance’s disk is temporary and exists only for the lifetime of that instance. When you delete an instance, all data on it is permanently removed. This change enables several important benefits:- Better GPU availability and lower pricing
- Access to new GPU types, including H100s and 8-GPU nodes
- Faster instance creation and deletion
Recommended Backup Solutions
With ephemeral storage, you’ll save money and avoid surprise bills from forgotten instances. Here’s how to manage your data effectively:1. Use GitHub for Your Code and Configuration (Recommended)
GitHub should be your primary backup solution for:- Code and scripts
- Configuration files
- Requirements and dependencies
- Jupyter notebooks
- Documentation
2. For Large Files, Choose What Works Best for You
For datasets, models, and checkpoints, you have two good options:Option A: Download to Your Local Computer
The simplest approach - just download large files to your local machine when you’re done with them. This is:- Free - no storage costs
- Fast - direct download/upload when you need it
- Simple - no additional services to set up
Option B: Use Cloud Object Storage
If local storage isn’t practical, cloud services are much cheaper than our legacy persistent storage:- Cloudflare R2 - S3-compatible storage with zero egress fees (10GB free)
- Google Drive - Simple and familiar interface (15GB free)
These cloud storage options cost significantly less than Thunder Compute’s legacy persistent storage and prevent you from accidentally leaving instances running and getting surprise bills.
Setting Up Backups
Using GitHub
For your code and configuration:Downloading Files to Your Local Computer
The simplest way to preserve your data is to download it directly to your local machine usingscp
, or by dragging and dropping in VSCode. With scp
:
Make sure you’ve connected to your instance with
tnr connect
first. This sets up the tnr-0
SSH alias.Best Practices
- Commit frequently - Push your code changes to GitHub regularly, especially before deleting an instance.
- Download important results - When you complete a training run or generate important outputs, download them to your local machine or upload to cloud storage right away.
- Separate data from code - Keep your code in GitHub and large datasets either on your local machine or in cloud storage (R2/Drive).
- Save checkpoints during long runs - For multi-day training jobs, periodically download checkpoints or upload them to cloud storage.
-
Use automation - Create scripts that automatically save your outputs:
-
Create setup scripts - Document your environment setup in a script that can quickly recreate your environment on a new instance:
Accessing Data from Old Instances
If you have existing instances with data you need to retrieve:- Change your instance type to a T4 (no GPU) to reduce costs
- Download your data using one of the backup methods above
- You have 30 days from October 8, 2025 to retrieve your data
After 30 days, data on old instances will be permanently deleted. Make sure to back up anything important before the deadline.
Need Help?
If you run into any issues setting up your backup workflow or have questions about ephemeral storage:- Join our Discord community
- Email us at [email protected]