How to introduce multimodal large models to enrich your product experience

Keynote

Abstract

With the emergence of large model technology, developers are beginning to use large models to develop their applications. However, in the actual development process, the vast majority of developers are more proficient in using text-based models for application development, and are not as skilled in using multi-modal large models to develop applications. In this sharing session, we will introduce some techniques and methods for developing applications based on multi-modal large models, providing explanations, usage instructions, cases, and tips for integrating common multi-modal large models. This will help developers understand the use of multi-modal large models, so they can utilize them in their own work scenarios.

Details

The content includes:

  1. The modalities currently supported by the model;
  2. Application scenarios for different modalities;
    1. Images
    2. Audio
    3. Video
  3. Usage and tips for the image modality
  4. Usage and tips for the audio modality
  5. Usage and tips for the video modality