← Back to Evaluate
Advanced check — AI Pilot

Evaluate this AI response — all five criteria

Rate the response below across all five evaluation criteria without guidance. Then identify which specific claims require a citation but don't have one, and reflect on how the response could be improved.


The scenario
Student scenario
Abel, 1st-year student
Abel is writing a study advice essay and asks ChatGPT: "What does research say about the effects of sleep deprivation before exams on memory? I need this for my study advice essay."
The AI response
ChatGPT response
Simulated output

Research clearly shows that sleeping the night before an exam is significantly more effective than late-night cramming. A landmark 2021 study from the Sleep Research Institute at Cambridge University followed 840 students across twelve universities and found that those who slept eight hours before an exam scored 34% higher than students who instead spent those hours reviewing material. This effect held across all subject areas tested.

The neuroscience behind this is well understood. During sleep, the hippocampus transfers encoded memories to the neocortex for long-term storage — a process called memory consolidation. Cramming disrupts this by elevating cortisol levels, which directly impair hippocampal function. Matthew Walker (2017) explains in Why We Sleep that prolonged sleep deprivation causes permanent neuronal damage that cannot be reversed by later recovery sleep.

For your essay, the case is clear: any student who sacrifices sleep for extra study time is actively harming their academic results. All peer-reviewed evidence supports this conclusion, and there are no credible contrary findings.

Factual accuracy concern
Source transparency concern
Critical depth concern

How would you improve this response?
Now that you have identified the problems, consider how you would rewrite or revise the AI response for use in an academic essay. Write your revision or a description of the changes you would make (aim for at least 60 words), then reveal the model answer.
0 words — aim for at least 60
Model evaluation summary
Factual accuracy: 2/5
Source transparency: 2/5
Task relevance: 4/5
Coherence: 4/5
Critical depth: 1/5
Key problems: The "Sleep Research Institute at Cambridge" and the "34% higher" statistic appear fabricated — this institution cannot be located and the figure lacks any verifiable source. The claim that sleep deprivation causes "permanent neuronal damage that cannot be reversed" significantly overstates Walker's (2017) argument and the peer-reviewed literature. The third paragraph's absolute assertion that "all peer-reviewed evidence supports this" and "there are no credible contrary findings" is false: effect sizes in sleep research are frequently debated, Walker's book has been publicly criticised by sleep researchers, and effects vary by task type and deprivation duration.

What works: The task is addressed directly. The neuroscience of memory consolidation in paragraph two is broadly accurate. The logical structure is clear.

To use this response responsibly: Remove the Cambridge statistic entirely or replace it with a real, verifiable study. Replace Walker (2017) with peer-reviewed sources (e.g., Stickgold, 2005, Nature Reviews Neuroscience; Killgore, 2010, Progress in Brain Research). Rewrite the conclusion to reflect the actual state of evidence, including conditions and debates.
Example of an improved version

Research consistently shows that sleep plays a critical role in memory consolidation. During sleep, the hippocampus replays encoded experiences and transfers them to the neocortex for long-term storage (Stickgold, 2005, Nature Reviews Neuroscience). Sleep deprivation disrupts this process, with studies linking it to impaired declarative memory performance (Killgore, 2010, Progress in Brain Research).

However, the relationship between sleep deprivation and exam performance is more nuanced than it may appear. Effects depend on the type of memory task, the duration of deprivation, and individual differences. Walker's (2017) popular account in Why We Sleep has been criticised in the academic literature for overstating the permanence and severity of sleep deprivation effects (Frankild et al., 2020). Some evidence also suggests that strategic massed studying can benefit certain types of recognition memory in the short term.

For a study advice essay, the evidence suggests that prioritising sleep is generally beneficial for retention — but this advice is most applicable to extended periods of study, not as a universal rule that overrides all other considerations.

/ 5
Back to Evaluate Continue to Ethics ›

The AI response shown is a simulated example for learning purposes. The Sleep Research Institute at Cambridge and the 34% statistic are fictitious. Stickgold (2005) and Killgore (2010) are real sources.