washpost20250604
Differences
This shows you the differences between two versions of the page.
washpost20250604 [2025.06.07 11:16] – created Steve Isenberg | washpost20250604 [2025.06.07 11:30] (current) – Steve Isenberg | ||
---|---|---|---|
Line 2: | Line 2: | ||
=====Summary of Washington Post 6/4/2025 Article===== | =====Summary of Washington Post 6/4/2025 Article===== | ||
- | [[https:// | + | Information summarized from [[https:// |
Washington Post ” | Washington Post ” | ||
Line 14: | Line 14: | ||
====Law==== | ====Law==== | ||
+ | Understanding two common legal contracts. | ||
* [6.9] Claude. Most consistently decent answers and did well suggesting changes to their test rental agreement. | * [6.9] Claude. Most consistently decent answers and did well suggesting changes to their test rental agreement. | ||
* [6.1] Gemini | * [6.1] Gemini | ||
Line 20: | Line 21: | ||
* [2.6] Meta AI. tried to reduce complex parts of the contracts to one-line summaries. Skipped several sections and important points. | * [2.6] Meta AI. tried to reduce complex parts of the contracts to one-line summaries. Skipped several sections and important points. | ||
+ | ====Health Science==== | ||
+ | Analyzing scientific research. | ||
+ | * [7.7] Claude. Good summary on paper on Long Covid; scored low on another paper when accounting for racial differences | ||
+ | * [7.2] ChatGPT | ||
+ | * [7.0] Copilot | ||
+ | * [6.5] Gemini. Left out key descriptions of the research on Parkinson' | ||
+ | * [6.0] Meta AI | ||
+ | |||
+ | ====Politics==== | ||
+ | Analyzing Trump' | ||
+ | |||
+ | * [7.2] ChatGPT. Impressive responses to half of questions posed to it; accurate fact-checking Trump' | ||
+ | * [6.2] Claude | ||
+ | * [5.2] Meta AI. Said Trump never said # jobs returning to MI and highlighted what Trump said about auto jobs. | ||
+ | * [5.0] Gemini | ||
+ | * [3.7] Copilot. Incorrect on # jobs returning to MI. Didn't capture charged nature of Trump' | ||
+ | |||
+ | ====Overall Winner==== | ||
+ | This according to The Washington Post | ||
+ | |||
+ | * [69.9] Claude - which was the only model that never hallucinated. | ||
+ | * [68.4] ChatGPT | ||
+ | * [49.7] Gemini | ||
+ | * [49.0] Copilot | ||
+ | * [45.0] Meta AI | ||
washpost20250604.txt · Last modified: by Steve Isenberg