This course covers using AI models for image-to-text (vision), text-to-speech, and speech-to-text tasks using the latest AP Is. It is part of the 'Getting Started with Generative AI API Specialization'.
Log in to write a review
Loading reviews...
Explore more courses and learning paths related to Multimodal Generative AI: Vision, Speech, and Assistants.
Browse more courses from Coursera
See the side-by-side breakdown and our pick by scenario
See the side-by-side breakdown and our pick by scenario
More intermediate-level AI and ML courses
Follow the full Advanced Generative 3D Models learning path
Browse 350+ structured AI learning paths from beginner to advanced
Provided by Coursera
Pricing: Free to audit, paid certificate
Free to audit. Certificate: $49-79. Coursera Plus: $59/month