Difference between revisions of "AI and Humans"

From GISAXS
Jump to: navigation, search
(Survey/study of)
(Creativity)
 
(10 intermediate revisions by the same user not shown)
Line 19: Line 19:
 
* [https://arxiv.org/abs/2402.09809 Effective and Scalable Math Support: Evidence on the Impact of an AI- Tutor on Math Achievement in Ghana]
 
* [https://arxiv.org/abs/2402.09809 Effective and Scalable Math Support: Evidence on the Impact of an AI- Tutor on Math Achievement in Ghana]
 
* [https://doi.org/10.21203/rs.3.rs-4243877/v1 AI Tutoring Outperforms Active Learning]
 
* [https://doi.org/10.21203/rs.3.rs-4243877/v1 AI Tutoring Outperforms Active Learning]
* [https://blogs.worldbank.org/en/education/From-chalkboards-to-chatbots-Transforming-learning-in-Nigeria From chalkboards to chatbots: Transforming learning in Nigeria, one prompt at a time]
+
* [https://documents.worldbank.org/en/publication/documents-reports/documentdetail/099548105192529324 From chalkboards to chatbots: Transforming learning in Nigeria, one prompt at a time] ([https://blogs.worldbank.org/en/education/From-chalkboards-to-chatbots-Transforming-learning-in-Nigeria writeup])
 
** 6 weeks of after-school AI tutoring = 2 years of typical learning gains
 
** 6 weeks of after-school AI tutoring = 2 years of typical learning gains
 
** outperforms 80% of other educational interventions
 
** outperforms 80% of other educational interventions
Line 86: Line 86:
  
 
===Creativity===
 
===Creativity===
 +
* See also: [[AI creativity]]
 
* 2023-07: [https://mackinstitute.wharton.upenn.edu/wp-content/uploads/2023/08/LLM-Ideas-Working-Paper.pdf Ideas Are Dimes A Dozen: Large Language Models For Idea Generation In Innovation]
 
* 2023-07: [https://mackinstitute.wharton.upenn.edu/wp-content/uploads/2023/08/LLM-Ideas-Working-Paper.pdf Ideas Are Dimes A Dozen: Large Language Models For Idea Generation In Innovation]
 
* 2023-09: [https://www.nature.com/articles/s41598-023-40858-3 Best humans still outperform artificial intelligence in a creative divergent thinking task]
 
* 2023-09: [https://www.nature.com/articles/s41598-023-40858-3 Best humans still outperform artificial intelligence in a creative divergent thinking task]
Line 175: Line 176:
 
* 2025-03: [https://www.medrxiv.org/content/10.1101/2025.02.28.25323115v1.full Medical Hallucination in Foundation Models and Their Impact on Healthcare]
 
* 2025-03: [https://www.medrxiv.org/content/10.1101/2025.02.28.25323115v1.full Medical Hallucination in Foundation Models and Their Impact on Healthcare]
 
* 2025-03: [https://journals.lww.com/international-journal-of-surgery/fulltext/2025/03000/chatgpt_s_role_in_alleviating_anxiety_in_total.20.aspx ChatGPT’s role in alleviating anxiety in total knee arthroplasty consent process: a randomized controlled trial pilot study]
 
* 2025-03: [https://journals.lww.com/international-journal-of-surgery/fulltext/2025/03000/chatgpt_s_role_in_alleviating_anxiety_in_total.20.aspx ChatGPT’s role in alleviating anxiety in total knee arthroplasty consent process: a randomized controlled trial pilot study]
 +
* 2025-05: [https://openai.com/index/healthbench/ Introducing HealthBench]
  
 
===Translation===
 
===Translation===
Line 183: Line 185:
  
 
===Creativity===
 
===Creativity===
 +
* See also: [[AI creativity]]
 
* 2024-07: [https://www.science.org/doi/10.1126/sciadv.adn5290 Generative AI enhances individual creativity but reduces the collective diversity of novel content]
 
* 2024-07: [https://www.science.org/doi/10.1126/sciadv.adn5290 Generative AI enhances individual creativity but reduces the collective diversity of novel content]
 
* 2024-08: [https://www.nature.com/articles/s41562-024-01953-1 An empirical investigation of the impact of ChatGPT on creativity]
 
* 2024-08: [https://www.nature.com/articles/s41562-024-01953-1 An empirical investigation of the impact of ChatGPT on creativity]
 +
** 2024-08: Response: [https://www.nature.com/articles/s41562-024-01953-1 ChatGPT decreases idea diversity in brainstorming] ([https://www.nature.com/articles/s41562-025-02173-x.epdf?sharing_token=LA9NyDHj7y5WN8zvb5Qm49RgN0jAjWel9jnR3ZoTv0Nl8PrpXFkjZ93XvmUVBgB9Hlfro5Yo6YELr-pRqbpk3HaZENCvsfV8G1kwtTEj2oW1g87dSVT4BzrfCu3jS_606SLzmoDuDiALChY-MozVM4Pj1b4Vdf-YaIH5p3lfAnM%3D pdf])
 +
** 2025-05: Response: [https://www.nature.com/articles/s41562-025-02195-5 Reply to: ChatGPT decreases idea diversity in brainstorming]
 
* 2024-08: [https://doi.org/10.1287/orsc.2023.18430 The Crowdless Future? Generative AI and Creative Problem-Solving]
 
* 2024-08: [https://doi.org/10.1287/orsc.2023.18430 The Crowdless Future? Generative AI and Creative Problem-Solving]
 
* 2024-10: [https://arxiv.org/abs/2410.03703 Human Creativity in the Age of LLMs]
 
* 2024-10: [https://arxiv.org/abs/2410.03703 Human Creativity in the Age of LLMs]
* 2024-11: [https://conference.nber.org/conf_papers/f210475.pdf Artificial Intelligence, Scientific Discovery, and Product Innovation]: diffusion model increases "innovation" (patents), boosts the best performers, but also removes some enjoyable tasks.
+
* 2024-11: <strike>[https://conference.nber.org/conf_papers/f210475.pdf Artificial Intelligence, Scientific Discovery, and Product Innovation]</strike>: diffusion model increases "innovation" (patents), boosts the best performers, but also removes some enjoyable tasks.
 +
** 2025-05: Retraction: [https://economics.mit.edu/news/assuring-accurate-research-record Assuring an accurate research record]
 
* 2024-12: [https://doi.org/10.1080/10400419.2024.2440691 Using AI to Generate Visual Art: Do Individual Differences in Creativity Predict AI-Assisted Art Quality?] ([https://osf.io/preprints/psyarxiv/ygzw6 preprint]): shows that more creative humans produce more creative genAI outputs
 
* 2024-12: [https://doi.org/10.1080/10400419.2024.2440691 Using AI to Generate Visual Art: Do Individual Differences in Creativity Predict AI-Assisted Art Quality?] ([https://osf.io/preprints/psyarxiv/ygzw6 preprint]): shows that more creative humans produce more creative genAI outputs
 
* 2025-01: [https://arxiv.org/abs/2501.11433 One Does Not Simply Meme Alone: Evaluating Co-Creativity Between LLMs and Humans in the Generation of Humor]
 
* 2025-01: [https://arxiv.org/abs/2501.11433 One Does Not Simply Meme Alone: Evaluating Co-Creativity Between LLMs and Humans in the Generation of Humor]
Line 200: Line 206:
 
==AI worse than humans==
 
==AI worse than humans==
 
* 2025-04: [https://spinup-000d1a-wp-offload-media.s3.amazonaws.com/faculty/wp-content/uploads/sites/27/2025/03/AI-debt-collection-20250331.pdf How Good is AI at Twisting Arms? Experiments in Debt Collection]
 
* 2025-04: [https://spinup-000d1a-wp-offload-media.s3.amazonaws.com/faculty/wp-content/uploads/sites/27/2025/03/AI-debt-collection-20250331.pdf How Good is AI at Twisting Arms? Experiments in Debt Collection]
 +
* 2025-04: [https://arxiv.org/abs/2504.18919 Clinical knowledge in LLMs does not translate to human interactions]
 +
* 2025-05: [https://royalsocietypublishing.org/doi/10.1098/rsos.241776 Generalization bias in large language model summarization of scientific research]
  
 
==Human Perceptions of AI==
 
==Human Perceptions of AI==
Line 241: Line 249:
 
* 2025-04: [https://andreyfradkin.com/assets/demandforllm.pdf Demand for LLMs: Descriptive Evidence on Substitution, Market Expansion, and Multihoming]
 
* 2025-04: [https://andreyfradkin.com/assets/demandforllm.pdf Demand for LLMs: Descriptive Evidence on Substitution, Market Expansion, and Multihoming]
 
* 2025-05: [https://civicscience.com/chatgpt-is-still-leading-the-ai-wars-but-google-gemini-is-gaining-ground/ ChatGPT Is Still Leading the AI Wars but Google Gemini Is Gaining Ground]
 
* 2025-05: [https://civicscience.com/chatgpt-is-still-leading-the-ai-wars-but-google-gemini-is-gaining-ground/ ChatGPT Is Still Leading the AI Wars but Google Gemini Is Gaining Ground]
 +
* 2025-05: [https://www.nber.org/papers/w33777 Large Language Models, Small Labor Market Effects]
 +
** Significant uptake, but very little economic impact so far
 +
* 2025-05: [https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5136877 The Labor Market Effects of Generative Artificial Intelligence]
 +
** US worker usage of AI increasingly rapidly: 30% in 2024-12; 40% in 2025-05
  
 
==Usage For==
 
==Usage For==
Line 262: Line 274:
 
* 2025-04: [https://drive.google.com/file/d/1Eo4SHrKGPErTzL1t_QmQhfZGU27jKBjx/edit Can AI Change Your View? Evidence from a Large-Scale Online Field Experiment]
 
* 2025-04: [https://drive.google.com/file/d/1Eo4SHrKGPErTzL1t_QmQhfZGU27jKBjx/edit Can AI Change Your View? Evidence from a Large-Scale Online Field Experiment]
 
** [https://www.404media.co/researchers-secretly-ran-a-massive-unauthorized-ai-persuasion-experiment-on-reddit-users/ Researchers Secretly Ran a Massive, Unauthorized AI Persuasion Experiment on Reddit Users]
 
** [https://www.404media.co/researchers-secretly-ran-a-massive-unauthorized-ai-persuasion-experiment-on-reddit-users/ Researchers Secretly Ran a Massive, Unauthorized AI Persuasion Experiment on Reddit Users]
 +
* 2025-05: [https://arxiv.org/abs/2505.09662 Large Language Models Are More Persuasive Than Incentivized Human Persuaders]
  
 
=Simulate Humans=
 
=Simulate Humans=

Latest revision as of 09:35, 28 May 2025

AI in Education

Survey/study of

AI improves learning/education

AI harms learning

Software/systems

LLMs

Individual tools

Systems

AI for grading

Detection

AI Text Detectors Don't Work

AI/human

Capabilities

Writing

AI out-performs humans

Tests

Creativity

Art

Business & Marketing

Professions

  • Humanity's Last Exam
    • Effort to build a dataset of challenging (but resolvable) questions in specific domain areas, to act as a benchmark to test whether AIs are improving in these challenging topics.

Coding

Medical

Bio

Therapy

Financial

AI improves human work

Coding

Forecasting

Finance

Law

Medical

Translation

Customer service

  • 2023-11: Generative AI at Work: Improvements for workers and clients (though also a ceiling to improvement)

Creativity

Equity

Counter loneliness

AI worse than humans

Human Perceptions of AI

AI passes Turing Test

Text Dialog

Art

Psychological Effects of AI Usage

Uptake

Usage For

Hiding Usage

Sentiment

Persuasion

(AI can update beliefs, change opinions, tackle conspiracy theories, etc.)

Simulate Humans

See Also