Generate text and analyze images using Gemini AI models
Convert input audio with a Persian voice model
Transform Persian audio using a model and index