Get in Touch

Course Outline

Hunyuan Multimodal Foundations and Lab Setup

  • Understanding Hunyuan's multimodal capabilities for image, 3D, and video use cases.
  • Identifying practical business scenarios for creative, product, and content teams.
  • Preparing the lab environment, sample assets, and model access.
  • Executing initial generation tasks and reviewing outputs.

Prompt Design and Workflow Patterns

  • Structuring prompts to achieve consistent multimodal results.
  • Working with text prompts, reference images, and basic input settings.
  • Selecting appropriate workflows for image, video, or 3D generation.
  • Iterating on prompts based on output quality and business objectives.

Image Generation and Review Labs

  • Creating marketing, product, and concept images from prompts.
  • Refining visual style, composition, and content consistency.
  • Reviewing outputs for utility, quality, and brand alignment.
  • Organizing image outputs for approval and downstream use.

Video Generation Labs

  • Generating short video outputs from prompts and prepared inputs.
  • Controlling style, scene intent, and output variation.
  • Reviewing videos for clarity, continuity, and practical applicability.
  • Preparing video outputs for demonstrations or content workflows.

3D Asset Creation Labs

  • Generating basic 3D assets from text or image inputs.
  • Assessing geometry, texture quality, and asset usability.
  • Exporting assets for visualization, prototyping, or content pipelines.
  • Evaluating when 3D generation is preferable to image or video workflows.

Integration, Governance, and Next Steps

  • Delivering generated assets via simple applications, services, or APIs.
  • Connecting multimodal outputs to product, content, and review workflows.
  • Implementing practical checks for quality, brand safety, copyright, and responsible use.
  • Planning pilot use cases and next steps for internal adoption.

Requirements

  • Fundamental understanding of AI and generative AI concepts.
  • Experience using web applications, APIs, or common developer tools.
  • Basic proficiency in Python or scripting.

Audience

  • Developers creating AI-enhanced product features.
  • Technical product managers and solution architects.
  • Innovation, media, and digital teams working with image, video, or 3D content.
 14 Hours

Number of participants


Price per participant

Upcoming Courses

Related Categories