File size: 578 Bytes
6eb130f |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 |
# Encoder Model
This directory contains the pre-trained encoder model for voice conversion.
## Model Details
- **File**: `encoder.pt`
- **Size**: ~17.1 MB
- **Input**: Audio waveform
- **Output**: Speaker embeddings
## Usage
```python
# Load the encoder model
encoder = torch.load('encoder.pt')
encoder.eval()
# Process audio
with torch.no_grad():
embedding = encoder(audio_tensor)
```
## Dependencies
- PyTorch
- NumPy
- Librosa (for audio processing)
## Model Configuration
See `config.json` for model architecture and training parameters. |