Frequently Asked Questions¤

Common questions about Workshop and their answers.

Installation¤

Q: How do I install Workshop?¤

A: Using uv (recommended):

uv pip install workshop

For GPU support:

uv sync --extra cuda12  # For CUDA 12

See the Installation Guide for more details.

Q: Do I need a GPU?¤

A: No, Workshop works on CPU. However, training large models is much faster on GPU. Workshop automatically uses GPU if available.

Q: Which Python version should I use?¤

A: Python 3.10 or later. We test on Python 3.10, 3.11, and 3.12.

Models¤

Q: Which models does Workshop support?¤

A: Workshop supports:

VAE (Variational Autoencoders)
GAN (Generative Adversarial Networks)
Diffusion Models (DDPM, DDIM)
Flow Models (RealNVP, Glow, Continuous Normalizing Flows)

See Models Overview for details.

Q: Can I use custom architectures?¤

A: Yes! Workshop provides flexibility for custom models. See Custom Architectures.

Q: How do I load pre-trained models?¤

A: Use the checkpointing system:

from workshop.generative_models.core.checkpointing import load_checkpoint

model, step = load_checkpoint(checkpoint_manager, model_template)

Training¤

Q: How do I train a model?¤

A: Basic training example:

from workshop.generative_models.training.trainer import Trainer

trainer = Trainer(model_config=config, training_config=train_config)
trainer.train(train_dataset, val_dataset)

See Training Guide for complete examples.

Q: Training is slow. How can I speed it up?¤

A: Several options:

Use GPU: Much faster than CPU
Increase batch size: Better GPU utilization
Use mixed precision: dtype=jnp.bfloat16
JIT compilation: Happens automatically with JAX
Data parallelism: Distribute across multiple GPUs

See Distributed Training.

Q: My model's loss is NaN. What's wrong?¤

A: Common causes:

Learning rate too high: Try reducing it (e.g., 1e-4 instead of 1e-3)
Gradient explosion: Add gradient clipping
Numerical instability: Check for divisions by zero or log(0)
Bad initialization: Use proper weight initialization

Q: How do I save checkpoints during training?¤

A: Checkpoints are saved automatically by the Trainer. Configure frequency:

training_config = TrainingConfiguration(
    num_epochs=100,
    checkpoint_every=1000,  # Save every 1000 steps
)

Data¤

Q: What data formats does Workshop support?¤

A: Workshop supports:

Images (PNG, JPG, NumPy arrays)
Text (tokenized sequences)
Audio (waveforms, spectrograms)
Multi-modal (combined modalities)

See Data Guide.

Q: How do I use my own dataset?¤

A: Create a custom dataset:

from workshop.generative_models.modalities.base import BaseDataset

class MyDataset(BaseDataset):
    def __len__(self):
        return len(self.data)

    def __iter__(self):
        for sample in self.data:
            yield {"data": sample}

See Custom Datasets.

Q: Can I use data augmentation?¤

A: Yes! Workshop provides augmentation for all modalities:

@jax.jit
def augment(batch, key):
    # Apply augmentation
    batch = random_flip(batch, key)
    batch = random_crop(batch, key)
    return batch

Technical¤

Q: Why JAX instead of PyTorch/TensorFlow?¤

A: JAX offers:

Automatic differentiation
JIT compilation for speed
Functional programming style
Easy parallelization with vmap/pmap
Better for research and experimentation

Q: Why Flax NNX specifically?¤

A: Flax NNX is:

Modern and actively developed
More Pythonic than Linen
Better for complex architectures
Easier state management
Official Flax direction

Q: Can I mix PyTorch/TensorFlow with Workshop?¤

A: No. Workshop uses JAX/Flax NNX exclusively. However, you can:

Convert PyTorch/TensorFlow weights to JAX
Use Workshop for training, export for PyTorch inference
Integrate at the data loading level

Q: How do I debug JAX code?¤

A: Debugging tips:

Disable JIT: Run without @jax.jit decorator
Print values: Use jax.debug.print()
Check shapes: Print intermediate shapes
Use assertions: Add shape checks
Breakpoints: Use jax.debug.breakpoint()

See JAX debugging docs for more details.

Performance¤

Q: How much memory do I need?¤

A: Depends on model size and batch size:

Small models (VAE): 2-4GB GPU
Medium models (StyleGAN): 8-16GB GPU
Large models (large diffusion): 24-40GB GPU

Use gradient accumulation for larger models.

Q: How many GPUs do I need?¤

A: One GPU is sufficient for most tasks. Multiple GPUs help with:

Larger batch sizes (data parallelism)
Larger models (model parallelism)
Faster training (distributed training)

See Distributed Training.

Q: Can I train on CPU?¤

A: Yes, but it's slow. Recommended for:

Testing and debugging
Small models
Limited datasets

Not recommended for production training.

Deployment¤

Q: How do I deploy a trained model?¤

A: Several options:

Export model: Save with checkpointing
Create REST API: Use Flask/FastAPI
Containerize: Use Docker
Cloud deployment: Deploy to cloud platforms

See Deployment Guide.

Q: Can I convert models to ONNX?¤

A: JAX models can be converted to ONNX for deployment, but it requires additional tools. See integration guides for details.

Q: How do I optimize for inference?¤

A: Optimization techniques:

JIT compilation: Automatic with JAX
Batching: Process multiple samples together
Mixed precision: Use bfloat16
Model pruning: Remove unnecessary parameters

See Optimization.

Troubleshooting¤

Q: I get "Out of Memory" errors¤

A: Solutions:

Reduce batch size: Smaller batches use less memory
Gradient accumulation: Simulate larger batches
Gradient checkpointing: Trade compute for memory
Mixed precision: Use bfloat16
Model parallelism: Split model across devices

Q: Tests are failing¤

A: Common issues:

Missing dependencies: Run uv sync --all-extras
CUDA not available: Some tests require GPU
Outdated code: Pull latest changes
Environment issues: Create fresh virtual environment

Q: Import errors¤

A: Check:

Installation: uv pip list | grep workshop
Python path: Verify workshop is installed
Virtual environment: Activate correct environment
Dependencies: Install with uv sync

Contributing¤

Q: How can I contribute?¤

A: Many ways to contribute:

Report bugs
Suggest features
Submit pull requests
Improve documentation
Help others in discussions

See Contributing Guide.

Q: I found a bug. What should I do?¤

A: Please open a GitHub issue with:

Description of the bug
Steps to reproduce
Expected vs actual behavior
System information (OS, Python version, etc.)

Q: Can I request a feature?¤

A: Yes! Open a GitHub issue describing:

Feature description
Use case
Why it's useful
Possible implementation ideas

Getting Help¤

Q: Where can I get help?¤

A: Several resources:

Documentation: Start here
GitHub Issues: For bugs and features
GitHub Discussions: For questions
Examples: Check examples directory

Q: Documentation is unclear¤

A: Please let us know! Open an issue describing:

Which page is unclear
What's confusing
What would help

We appreciate feedback to improve docs.

License¤

Q: What's the license?¤

A: Workshop is open source under the MIT License. You can:

Use commercially
Modify the code
Distribute
Use privately

See LICENSE file for details.

Q: Can I use Workshop in my company?¤

A: Yes! The MIT License allows commercial use.