Differences

This shows you the differences between two versions of the page.

--- washpost20250604 [2025.06.07 11:16] – created Steve Isenberg
+++ washpost20250604 [2025.06.07 11:30] (current) – Steve Isenberg
@@ Line 2: / Line 2: @@
 =====Summary of Washington Post 6/4/2025 Article=====
-[[https://www.washingtonpost.com/technology/2025/06/04/ai-summarizers-analysis-test-documents-books/|the article]]
+Information summarized from [[https://www.washingtonpost.com/technology/2025/06/04/ai-summarizers-analysis-test-documents-books/|The Washington Post article]]
 Washington Post ”challenged AI helpers to decode legal contracts, simplify medical research, speed-read a novel, and make sense of Trump speeches.”  They asked ChatGPT, Claude, Copilot, Meta AI, and Gemini.  Responses varied from good to bad.  Scores are out of 10.
@@ Line 14: / Line 14: @@
 ====Law====
+Understanding two common legal contracts.
   * [6.9] Claude. Most consistently decent answers and did well suggesting changes to their test rental agreement.
   * [6.1] Gemini
@@ Line 20: / Line 21: @@
   * [2.6] Meta AI. tried to reduce complex parts of the contracts to one-line summaries. Skipped several sections and important points.
+====Health Science====
+Analyzing scientific research.
+  * [7.7] Claude. Good summary on paper on Long Covid; scored low on another paper when accounting for racial differences
+  * [7.2] ChatGPT
+  * [7.0] Copilot
+  * [6.5] Gemini. Left out key descriptions of the research on Parkinson's disease and why it mattered.
+  * [6.0] Meta AI
+====Politics====
+Analyzing Trump's speeches.
+  * [7.2] ChatGPT. Impressive responses to half of questions posed to it; accurate fact-checking Trump's claims about winning 2020 election.
+  * [6.2] Claude
+  * [5.2] Meta AI. Said Trump never said # jobs returning to MI and highlighted what Trump said about auto jobs.
+  * [5.0] Gemini
+  * [3.7] Copilot. Incorrect on # jobs returning to MI. Didn't capture charged nature of Trump's speech.
+====Overall Winner====
+This according to The Washington Post
+  * [69.9] Claude - which was the only model that never hallucinated.
+  * [68.4] ChatGPT
+  * [49.7] Gemini
+  * [49.0] Copilot
+  * [45.0] Meta AI