The "To Keep Up" Wiki

A collection of information we find useful

User Tools

Site Tools


washpost20250604

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

washpost20250604 [2025.06.07 11:16] – created Steve Isenbergwashpost20250604 [2025.06.07 11:30] (current) Steve Isenberg
Line 2: Line 2:
  
 =====Summary of Washington Post 6/4/2025 Article===== =====Summary of Washington Post 6/4/2025 Article=====
-[[https://www.washingtonpost.com/technology/2025/06/04/ai-summarizers-analysis-test-documents-books/|the article]]+Information summarized from [[https://www.washingtonpost.com/technology/2025/06/04/ai-summarizers-analysis-test-documents-books/|The Washington Post article]]
  
 Washington Post ”challenged AI helpers to decode legal contracts, simplify medical research, speed-read a novel, and make sense of Trump speeches.”  They asked ChatGPT, Claude, Copilot, Meta AI, and Gemini.  Responses varied from good to bad.  Scores are out of 10. Washington Post ”challenged AI helpers to decode legal contracts, simplify medical research, speed-read a novel, and make sense of Trump speeches.”  They asked ChatGPT, Claude, Copilot, Meta AI, and Gemini.  Responses varied from good to bad.  Scores are out of 10.
Line 14: Line 14:
  
 ====Law==== ====Law====
 +Understanding two common legal contracts.
   * [6.9] Claude. Most consistently decent answers and did well suggesting changes to their test rental agreement.   * [6.9] Claude. Most consistently decent answers and did well suggesting changes to their test rental agreement.
   * [6.1] Gemini   * [6.1] Gemini
Line 20: Line 21:
   * [2.6] Meta AI. tried to reduce complex parts of the contracts to one-line summaries. Skipped several sections and important points.   * [2.6] Meta AI. tried to reduce complex parts of the contracts to one-line summaries. Skipped several sections and important points.
  
 +====Health Science====
 +Analyzing scientific research.
  
 +  * [7.7] Claude. Good summary on paper on Long Covid; scored low on another paper when accounting for racial differences
 +  * [7.2] ChatGPT
 +  * [7.0] Copilot
 +  * [6.5] Gemini. Left out key descriptions of the research on Parkinson's disease and why it mattered.
 +  * [6.0] Meta AI
 +
 +====Politics====
 +Analyzing Trump's speeches.
 +
 +  * [7.2] ChatGPT. Impressive responses to half of questions posed to it; accurate fact-checking Trump's claims about winning 2020 election.
 +  * [6.2] Claude
 +  * [5.2] Meta AI. Said Trump never said # jobs returning to MI and highlighted what Trump said about auto jobs. 
 +  * [5.0] Gemini
 +  * [3.7] Copilot. Incorrect on # jobs returning to MI. Didn't capture charged nature of Trump's speech.
 +
 +====Overall Winner====
 +This according to The Washington Post
 +
 +  * [69.9] Claude - which was the only model that never hallucinated.
 +  * [68.4] ChatGPT
 +  * [49.7] Gemini
 +  * [49.0] Copilot
 +  * [45.0] Meta AI
  
washpost20250604.txt · Last modified: by Steve Isenberg