AJ50
/

voice-clone-encoder

speaker_encoder

Model card Files Files and versions

voice-clone-encoder / Readme.md

AJ50's picture

Add encoder model with config and documentation

6eb130f verified 22 days ago

|

history blame contribute delete

578 Bytes

	# Encoder Model

	This directory contains the pre-trained encoder model for voice conversion.

	## Model Details
	- File: `encoder.pt`
	- Size: ~17.1 MB
	- Input: Audio waveform
	- Output: Speaker embeddings

	## Usage
	```python
	# Load the encoder model
	encoder = torch.load('encoder.pt')
	encoder.eval()

	# Process audio
	with torch.no_grad():
	embedding = encoder(audio_tensor)
	```

	## Dependencies
	- PyTorch
	- NumPy
	- Librosa (for audio processing)

	## Model Configuration
	See `config.json` for model architecture and training parameters.