Question: Question 2 : Hallucinations in LLMs: Identification and Mitigation Comparative Study Across LLMs: Select any two publicly available LLMs ( e . g . ,
Question : Hallucinations in LLMs: Identification and Mitigation
Comparative Study Across LLMs: Select any two publicly available LLMs eg GPT LLaMA, BLOOM, Claude, etc. and compare their hallucination patterns in three distinct domains: history, technologyscience and medicine.
Design complex, multipart prompts for each domain total prompts The prompts must challenge the model with facts, reasoning, and synthesis, probing areas where hallucinations are likely to occur.
Identify at least three types of hallucinations:
a Factual Hallucinations: Incorrect information.
b Logical Hallucinations: Errors in reasoning.
c Contradictory Hallucinations: Instances where the model contradicts itself within the same or multiple responses.
Quantify the frequency of each type of hallucination across both models and domains. Develop a hallucination taxonomy to categorize and understand the variations in hallucination behavior.
Novel Mitigation Strategy
Based on your findings, propose a novel method for hallucination mitigation. Your method must include:
A novel prompt design or external augmentation strategy eg introducing external fact verification, reasoning chains, or contextspecific finetuning
Justify the effectiveness of your proposed method and compare it with two existing mitigation approaches, and cite them.
How would you quantitatively measure the generated output for the hallucination?
Bonus
Hallucination Detection Framework: Propose a lightweight hallucination detection framework that could be incorporated into LLM deployment pipelines. This framework should work in real time to flag potentially hallucinated responses based on the patterns identified in your study.
LLMAssisted Evaluation: Use PerplexityLLM for the same prompts and check the alignment with the output of your model.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
