verified Verified Information • Last Updated Mar 2026

Multimodal Generative AI: Vision, Speech, and Assistants

We are introducing a new course to replace the "Coding with ChatGPT" course in the Generative AI specialization. This updated course will cover materials, models, and content released in 2024. Some of the new additions include material on using AI for image-to-text (vision), text-to-speech, speech-to-text, and the Assistant API. All these topics come with new labs, lessons, and exercises.
Duration 6 Months
Institution Codio
Format Online

Eligibility Criteria

school

Academic Foundation

A recognized Bachelor’s degree or high school equivalent required for admission into Codio.

language

Language Proficiency

English proficiency required. IELTS, TOEFL, or standard medium-of-instruction certificates accepted.

Detailed Fees Breakdown

Base Tuition Fee $117
Total Est. Investment $117

Scholarships and early-bird waivers may apply. Contact admissions for exact institutional fees.

Academic Trajectory

Program Outcome

Graduates of the Multimodal Generative AI: Vision, Speech, and Assistants program at Codio are equipped with global perspectives, ready to excel in international markets and top-tier career opportunities.

headset_mic
Get In Touch