Best Practices Advanced

Get the most out of Google Colab with these best practices for session management, memory optimization, GPU usage, collaboration, and working within free tier limitations.

Session Management

Structure your notebook: Put all imports and installations at the top so they are easy to re-run after a restart
Save checkpoints to Drive: Save model weights and intermediate results to Google Drive during long training runs
Use try/except: Wrap long operations in try/except blocks to save progress if an error occurs
Pin important revisions: Use File → Save and pin revision before making major changes

Avoiding Disconnections

Colab disconnects after ~90 minutes of inactivity or when the maximum session time is reached:

Keep the browser tab active: Colab detects idle tabs and may disconnect. Keep the tab in the foreground
Use Colab Pro for longer sessions: Pro and Pro+ offer extended session times up to 24 hours
Save frequently: Auto-save works, but manually save before leaving your notebook unattended
Background execution (Pro): Colab Pro allows notebooks to run even when the browser is closed

💡

Note: Do not use scripts or browser extensions to prevent Colab from disconnecting. This violates Google's terms of service and may result in restricted access.

Memory Optimization

# Monitor RAM and GPU memory
import psutil
print(f"RAM Used: {psutil.virtual_memory().percent}%")
print(f"RAM Available: {psutil.virtual_memory().available / 1e9:.1f} GB")

# Delete unused variables
del large_dataframe
import gc
gc.collect()

# Use efficient data types
import pandas as pd
df = pd.read_csv('data.csv')
# Convert float64 to float32 to halve memory
for col in df.select_dtypes(include=['float64']).columns:
    df[col] = df[col].astype('float32')

# Use generators instead of loading everything into memory
def data_generator(file_path, chunk_size=10000):
    for chunk in pd.read_csv(file_path, chunksize=chunk_size):
        yield chunk

GPU Best Practices

Only enable GPU when needed: Use CPU runtime for data preprocessing and EDA to save GPU quota
Use mixed precision: Train with float16 to reduce memory usage by ~50% and speed up training
Gradient accumulation: Simulate larger batch sizes without increasing memory by accumulating gradients
Clear GPU cache: Call torch.cuda.empty_cache() after deleting tensors
Monitor usage: Run !nvidia-smi regularly to check GPU memory consumption
Use gradient checkpointing: Trade computation for memory by recomputing activations during backward pass

Collaboration Tips

Use consistent formatting: Follow a cell structure: imports, constants, functions, main code, results
Document cells: Add a text cell before each code section explaining what it does and why
Share with "Viewer" access: Prevent accidental modifications by sharing as "Viewer" instead of "Editor"
Use comments: Right-click a cell to add comments for code review discussions
Version control: Save to GitHub for proper version tracking beyond Colab's revision history

Free Tier Limitations

Limitation	Details	Workaround
Session time	~12 hours maximum	Save checkpoints to Drive; restart and resume
Idle timeout	~90 minutes	Keep the tab active; upgrade to Pro for background execution
GPU availability	Not guaranteed; may get CPU only	Try again later; upgrade to Pro for priority
RAM	~12 GB standard	Optimize memory; use High-RAM runtime (Pro)
Disk space	~78 GB runtime storage	Use Google Drive for persistent large files
No persistent state	Variables and packages reset each session	Save state to Drive; re-run setup cells

Alternatives When Colab Is Not Enough

Kaggle Notebooks: Free P100/T4 GPUs with 30 hours/week — good when Colab GPU is unavailable
Google Cloud Vertex AI: Professional ML platform with dedicated resources and SLAs
AWS SageMaker Studio Lab: Free GPU notebooks from Amazon with persistent environments
Paperspace Gradient: Free GPU notebooks with more powerful hardware options
Local setup: Install Jupyter locally with Anaconda for unlimited resources using your own hardware

Frequently Asked Questions

Is Google Colab really free?

Yes, Google Colab offers a free tier with access to GPUs (T4) and TPUs, though availability is not guaranteed and there are usage limits. Paid tiers (Pro at $9.99/month and Pro+ at $49.99/month) offer priority access, faster GPUs, more RAM, and longer sessions.

Can I use Colab for production workloads?

Colab is designed for interactive development, not production. For production ML workloads, consider Google Cloud Vertex AI, AWS SageMaker, or a dedicated server. Colab Enterprise is available for organizations needing managed security and compliance.

How do I keep my Colab session from disconnecting?

Keep the browser tab active and in the foreground. On the free tier, sessions last up to ~12 hours with a ~90 minute idle timeout. Colab Pro and Pro+ offer background execution so notebooks can run with the browser closed.

Can I install any Python package in Colab?

Yes, you can install any pip-installable package using !pip install package_name. However, installed packages are lost when the runtime resets. Some system-level packages can also be installed using !apt-get install.

How do I get a better GPU in Colab?

Upgrade to Colab Pro ($9.99/month) for priority access to T4 and V100 GPUs, or Colab Pro+ ($49.99/month) for A100 access. Free tier users get whatever GPU is available, which is typically a T4 but may be unavailable during high-demand periods.

Can I use Colab offline?

No, Colab requires an internet connection to connect to Google's cloud infrastructure. For offline work, use Jupyter Notebook installed locally via Anaconda or pip.

← Data & Files Course Overview →