- Download Data
-
Image1 - Violation percentage of safety with different LLMs across categories.
-
Image2 - Llama2 vs Mistral model performances across various NLP tasks.
-
Chain of thought prompting.
-
Image3 - Performances of different LLMs across various NLP tasks.
-
Chain of thought prompting.
-
Final Observations: