- Load and initialize Replicate
-
Download Images and Load Images locally
- Provide various prompts to test different Multi Modal LLMs
- Generate Image Reasoning from different LLMs with different prompts for different images
- Display Sampled Responses from Multi-Modal LLMs
- Human Label the Correctness and Relevance of the Multi-Modal LLM Reasoning Results
- Summary of preliminary findings with evaluated Multi-Modal Models
-
Replicate Stream Complete, Async Complete, Async Stream Complete Mode