Course outline
Duration : 1 Day |
© AFI Expertise inc. | |
This one‑day course focuses on building intelligent applications that can see, interpret, and reason over images and documents using various multimodal models and agent‑based tools. Learners explore how visual and document inputs can be combined with language models to enable structured data extraction, analysis, and decision‑making workflows. The course emphasizes practical approaches to information extraction, tool orchestration, and grounding model responses in visual data. | |
Audience | This course is intended for developers, AI engineers, and technical professionals who want to design and deploy Azure applications that leverage images and documents using multimodal models and Azure AI services. It is suitable for learners with basic programming skills and a general understanding of cloud or AI concepts. |
Prerequisites |
|
Objectives | By the end of this course, participants will be able to:
|
Teaching method | Instructor‑led training delivered by a Microsoft Certified Trainer. |
Contents | Module 1 – Develop a Vision‑Enabled Generative AI Application
Module 2 – Generate Images with AI
Module 3 – Generate Videos with Microsoft Foundry
Module 4 – Analyze Images with Content Understanding
Module 5 – Create a Multimodal Analysis Solution with Azure Content Understanding
Module 6 – Create an Azure Content Understanding Client Application
Module 7 – Extract Data with Azure Document Intelligence
Module 8 – Create a Knowledge Mining Solution with Azure AI Search
|