File size: 578 Bytes
6eb130f
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
# Encoder Model

This directory contains the pre-trained encoder model for voice conversion.

## Model Details
- **File**: `encoder.pt`
- **Size**: ~17.1 MB
- **Input**: Audio waveform
- **Output**: Speaker embeddings

## Usage
```python

# Load the encoder model

encoder = torch.load('encoder.pt')

encoder.eval()



# Process audio

with torch.no_grad():

    embedding = encoder(audio_tensor)

```

## Dependencies
- PyTorch
- NumPy
- Librosa (for audio processing)

## Model Configuration
See `config.json` for model architecture and training parameters.