washpost20250604
Differences
This shows you the differences between two versions of the page.
| washpost20250604 [2025.06.07 11:16] – created Steve Isenberg | washpost20250604 [2025.06.07 11:30] (current) – Steve Isenberg | ||
|---|---|---|---|
| Line 2: | Line 2: | ||
| =====Summary of Washington Post 6/4/2025 Article===== | =====Summary of Washington Post 6/4/2025 Article===== | ||
| - | [[https:// | + | Information summarized from [[https:// |
| Washington Post ” | Washington Post ” | ||
| Line 14: | Line 14: | ||
| ====Law==== | ====Law==== | ||
| + | Understanding two common legal contracts. | ||
| * [6.9] Claude. Most consistently decent answers and did well suggesting changes to their test rental agreement. | * [6.9] Claude. Most consistently decent answers and did well suggesting changes to their test rental agreement. | ||
| * [6.1] Gemini | * [6.1] Gemini | ||
| Line 20: | Line 21: | ||
| * [2.6] Meta AI. tried to reduce complex parts of the contracts to one-line summaries. Skipped several sections and important points. | * [2.6] Meta AI. tried to reduce complex parts of the contracts to one-line summaries. Skipped several sections and important points. | ||
| + | ====Health Science==== | ||
| + | Analyzing scientific research. | ||
| + | * [7.7] Claude. Good summary on paper on Long Covid; scored low on another paper when accounting for racial differences | ||
| + | * [7.2] ChatGPT | ||
| + | * [7.0] Copilot | ||
| + | * [6.5] Gemini. Left out key descriptions of the research on Parkinson' | ||
| + | * [6.0] Meta AI | ||
| + | |||
| + | ====Politics==== | ||
| + | Analyzing Trump' | ||
| + | |||
| + | * [7.2] ChatGPT. Impressive responses to half of questions posed to it; accurate fact-checking Trump' | ||
| + | * [6.2] Claude | ||
| + | * [5.2] Meta AI. Said Trump never said # jobs returning to MI and highlighted what Trump said about auto jobs. | ||
| + | * [5.0] Gemini | ||
| + | * [3.7] Copilot. Incorrect on # jobs returning to MI. Didn't capture charged nature of Trump' | ||
| + | |||
| + | ====Overall Winner==== | ||
| + | This according to The Washington Post | ||
| + | |||
| + | * [69.9] Claude - which was the only model that never hallucinated. | ||
| + | * [68.4] ChatGPT | ||
| + | * [49.7] Gemini | ||
| + | * [49.0] Copilot | ||
| + | * [45.0] Meta AI | ||
washpost20250604.1749320186.txt.gz · Last modified: by Steve Isenberg
