Configuration Input
▼
Paste YAML or JSON configuration
Calculate Memory from Config
Manual Configuration
▼
Model Architecture
▼
Hidden Size
Number of Layers
Vocabulary Size
Intermediate Size
Number of Attention Heads
Number of Key Value Heads
Tie Word Embeddings
Training Configuration
▼
Sequence Length
Micro Batch Size
Gradient Accumulation Steps
Parallelism
▼
Tensor Parallelism
Pipeline Parallelism
Data Parallelism
ZeRO Stage
0
1
2
3
Full Activation Checkpointing
Calculate Memory (Manual Input)
OOM Prediction
Memory Component Breakdown
Aggregate Memory Metrics