- Download Image Locally
- Initialize Pydantic Class for Restaurant
- Load OpenAI GPT4V Multi-Modal LLM Model
- Plot the image
- Using Multi-Modal Pydantic Program to generate structured data from GPT4V Output for Restaurant Image
-
Test Pydantic for MiniGPT-4, Fuyu-8B, LLaVa-13B, CogVLM models
-
Change to Amazon Product Example
-
Initialize the Amazon Product Pydantic Class
- Using Multi-Modal Pydantic Program to generate structured data from GPT4V Output for Amazon Product Image
-
Test Pydantic for MiniGPT-4, Fuyu-8B, LLaVa-13B, CogVLM models
- Initialize the Instagram Ads Pydantic Class and compare performance of different Multi-Modal LLMs