Revision as of 09:26, 28 July 2025

AI in Education

Survey/study of

2023-08: Perception, performance, and detectability of conversational artificial intelligence across 32 university courses
2023-10: Employees secretly using AI at work.
2023-10: Survey shows students using AI more than professors.
2023-11: ChatGPT has entered the classroom: how LLMs could transform education
2025-04: Anthropic Education Report: How University Students Use Claude
2025-05: The effect of ChatGPT on students’ learning performance, learning perception, and higher-order thinking: insights from a meta-analysis

AI improves learning/education

Mollick, Ethan R. and Mollick, Lilach and Bach, Natalie and Ciccarelli, LJ and Przystanski, Ben and Ravipinto, Daniel, AI Agents and Education: Simulated Practice at Scale (June 17, 2024). The Wharton School Research Paper. doi: 10.2139/ssrn.4871171
- Can enable personalized education.
Generative AI for Programming Education: Benchmarking ChatGPT, GPT-4, and Human Tutors
- GPT4 can out-perform human tutors.
Keppler, Samantha and Sinchaisri, Wichinpong and Snyder, Clare, Backwards Planning with Generative AI: Case Study Evidence from US K12 Teachers (August 13, 2024). doi: 10.2139/ssrn.4924786
- Teachers benefit from using AI as a co-pilot to aid in tasks (planning, how to teach topic, explore ideas).
- There is smaller utility in using AI purely as a text-generator (to make quizzes, workbooks, etc.).
Effective and Scalable Math Support: Evidence on the Impact of an AI- Tutor on Math Achievement in Ghana
AI Tutoring Outperforms Active Learning
From chalkboards to chatbots: Transforming learning in Nigeria, one prompt at a time (writeup)
- 6 weeks of after-school AI tutoring = 2 years of typical learning gains
- outperforms 80% of other educational interventions
AI Meets the Classroom: When Do Large Language Models Harm Learning?
- Outcomes depend on usage
LLM Support for Tutors GPT-4 boosts remote tutors’ performance in real time, study finds
- Tutor CoPilot: A Human-AI Approach for Scaling Real-Time Expertise
2025-06: Gallup & The Walton Foundation: Teaching for Tomorrow Unlocking Six Weeks a Year With AI

AI harms learning

A real-world test of artificial intelligence infiltration of a university examinations system: A “Turing Test” case study
- Current grading systems cannot detect AI.
Bastani, Hamsa and Bastani, Osbert and Sungu, Alp and Ge, Haosen and Kabakcı, Özge and Mariman, Rei, Generative AI Can Harm Learning (July 15, 2024). The Wharton School Research Paper.doi: 10.2139/ssrn.4895486
- Access to ChatGPT harmed math education outcomes.
2024-09: AI Meets the Classroom: When Does ChatGPT Harm Learning?

Software/systems

GPTutor (code)
EduChat: A Large-Scale Language Model-based Chatbot System for Intelligent Education
Eureka Labs (founded by Andrej Karpathy) aims to create AI-driven courses (first course is Intro to LLMs)

LLMs

2024-12: LearnLM: Improving Gemini for Learning

Individual tools

Chatbot (OpenAI ChatGPT, Anthropic Claude, Google Gemini)
NotebookLM: Enables one to "chat with documents".
Google Learn About

Systems

Anthropic Claude for Education

AI for grading

Can Large Language Models Make the Grade? An Empirical Study Evaluating LLMs Ability To Mark Short Answer Questions in K-12 Education (preprint)

Detection

2024-06: Do teachers spot AI? Evaluating the detectability of AI-generated texts among student essays
- GenAI can simulate student writing in a way that teachers cannot detect.
- AI essays are assessed more positively than student-written.
- Teachers are overconfident in their source identification.
- Both novice and experienced teachers could not identify texts generated by ChatGPT vs. students
2025-01: People who frequently use ChatGPT for writing tasks are accurate and robust detectors of AI-generated text

AI Text Detectors Don't Work

AI/human

Capabilities

Writing

2022-12: Re3: Generating Longer Stories With Recursive Reprompting and Revision
2023-03: English essays: Artificial intelligence (AI) technology in OpenAI ChatGPT application: A review of ChatGPT in writing English essay
2023-01: Journalism: Collaborating With ChatGPT: Considering the Implications of Generative Artificial Intelligence for Journalism and Media Education
2023-07: Science writing: Artificial intelligence in scientific writing: a friend or a foe?
2024-02: Wikipedia style: Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models
2024-02: LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs (code)
2024-08: Scientific papers: [The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery]
2024-09: PaperQA2: Language Models Achieve Superhuman Synthesis of Scientific Knowledge (𝕏 post, code)
2025-03: WikiAutoGen: Towards Multi-Modal Wikipedia-Style Article Generation
2025-03: Learning to Reason for Long-Form Story Generation

AI out-performs humans

Tests

2023-07: SciBench: Evaluating College-Level Scientific Problem-Solving Abilities of Large Language Models
2024-06: A real-world test of artificial intelligence infiltration of a university examinations system: A “Turing Test” case study
- AI scores higher than median students.

Creativity

See also: AI creativity
2023-07: Ideas Are Dimes A Dozen: Large Language Models For Idea Generation In Innovation
2023-09: Best humans still outperform artificial intelligence in a creative divergent thinking task
- Best humans out-perform AI at creativity. (By implication, median humans may not.)
2024-02: The current state of artificial intelligence generative language models is more creative than humans on divergent thinking tasks
2024-02: Felin, Teppo and Holweg, Matthias, Theory Is All You Need: AI, Human Cognition, and Causal Reasoning (February 24, 2024). doi: 10.2139/ssrn.4737265
- Argues that human "theory-based" creativity is better than AI "data-based".
2024-07: Pron vs Prompt: Can Large Language Models already Challenge a World-Class Fiction Author at Creative Text Writing?
- Top human (professional author) out-performs GPT4.
2024-09: Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers
- LLMs can be creative
2024-09: Creative and Strategic Capabilities of Generative AI: Evidence from Large-Scale Experiments
2025-06: Predicting Empirical AI Research Outcomes with Language Models

Art

Business & Marketing

Professions

Humanity's Last Exam
- Effort to build a dataset of challenging (but resolvable) questions in specific domain areas, to act as a benchmark to test whether AIs are improving in these challenging topics.

AI improves human work

2023-07: Experimental evidence on the productivity effects of generative artificial intelligence
2023-09: Dell'Acqua, Fabrizio and McFowland III, Edward and Mollick, Ethan R. and Lifshitz-Assaf, Hila and Kellogg, Katherine and Rajendran, Saran and Krayer, Lisa and Candelon, François and Lakhani, Karim R., Navigating the Jagged Technological Frontier: Field Experimental Evidence of the Effects of AI on Knowledge Worker Productivity and Quality (September 15, 2023). Harvard Business School Technology & Operations Mgt. Unit Working Paper No. 24-013, The Wharton School Research Paper doi: 10.2139/ssrn.4573321
2023-11: Generative AI at Work (National Bureau of Economic Research)
2023-12: The Uneven Impact of Generative AI on Entrepreneurial Performance (doi: 10.31219/osf.io/hdjpk)
2023-12: Artificial Intelligence in the Knowledge Economy: Non-autonomous AI (chatbot) benefits least knowledgeable workers; autonomous agents benefit the most knowledgeable workers
2024-07: Generative AI in Real-World Workplaces: The Second Microsoft Report on AI and Productivity Research
2025-03: The Cybernetic Teammate: A Field Experiment on Generative AI Reshaping Teamwork and Expertise
- 2025-03: Ethan Mollick: Cybernetic Teammate: Having an AI on your team can increase performance, provide expertise, and improve your experience

Coding

2023-02: The Impact of AI on Developer Productivity: Evidence from GitHub Copilot
2024-09: Cui, Zheyuan and Demirer, Mert and Jaffe, Sonia and Musolff, Leon and Peng, Sida and Salz, Tobias, The Effects of Generative AI on High Skilled Work: Evidence from Three Field Experiments with Software Developers (September 03, 2024). doi: 10.2139/ssrn.4945566
2024-11: Hoffmann, Manuel and Boysel, Sam and Nagle, Frank and Peng, Sida and Xu, Kevin, Generative AI and the Nature of Work (October 27, 2024). Harvard Business School Strategy Unit Working Paper No. 25-021, Harvard Business Working Paper No. No. 25-021, doi: 10.2139/ssrn.5007084

Forecasting

2024-02: AI-Augmented Predictions: LLM Assistants Improve Human Forecasting Accuracy

Finance

2024-12: AI, Investment Decisions, and Inequality: Novices see improvements in investment performance, sophisticated investors see even greater improvements.

Law

2025-03: AI-Powered Lawyering: AI Reasoning Models, Retrieval Augmented Generation, and the Future of Legal Practice

Medical

Translation

2025-01: A professional workflow for translation using LLMs (based on this)

Customer service

2023-11: Generative AI at Work: Improvements for workers and clients (though also a ceiling to improvement)

Creativity

See also: AI creativity
2024-07: Generative AI enhances individual creativity but reduces the collective diversity of novel content
2024-08: An empirical investigation of the impact of ChatGPT on creativity
- 2024-08: Response: ChatGPT decreases idea diversity in brainstorming (pdf)
- 2025-05: Response: Reply to: ChatGPT decreases idea diversity in brainstorming
2024-08: The Crowdless Future? Generative AI and Creative Problem-Solving
2024-10: Human Creativity in the Age of LLMs
2024-11: ~~Artificial Intelligence, Scientific Discovery, and Product Innovation~~: diffusion model increases "innovation" (patents), boosts the best performers, but also removes some enjoyable tasks.
- 2025-05: Retraction: Assuring an accurate research record
2024-12: Using AI to Generate Visual Art: Do Individual Differences in Creativity Predict AI-Assisted Art Quality? (preprint): shows that more creative humans produce more creative genAI outputs
2025-01: One Does Not Simply Meme Alone: Evaluating Co-Creativity Between LLMs and Humans in the Generation of Humor

Equity

2025-01: Using Large Language Models to Promote Health Equity

AI worse than humans

AI lowers human productivity

2025-07: METR: Measuring the Impact of Early-2025 AI on Experienced Open-Source Developer Productivity (blog, commentary/analysis)

Human Perceptions of AI

AI passes Turing Test

Text Dialog

2023-05: Human or Not? A Gamified Approach to the Turing Test
2023-10: Does GPT-4 pass the Turing test?
2024-05: People cannot distinguish GPT-4 from a human in a Turing test
2024-07: GPT-4 is judged more human than humans in displaced and inverted Turing tests
2025-03: Large Language Models Pass the Turing Test
2025-04: A Minimal Turing Test

Art

2024-11: How Did You Do On The AI Art Turing Test? Differentiation was only slightly above random (60%). AI art was often ranked higher than human-made.
2024-11: AI-generated poetry is indistinguishable from human-written poetry and is rated more favorably

Uptake

2023-07: ChatGPT: Early Adopters, Teething Issues and the Way Forward
2024-03: Monitoring AI-Modified Content at Scale: A Case Study on the Impact of ChatGPT on AI Conference Peer Reviews
2024-05: Humlum, Anders and Vestergaard, Emilie, The Adoption of ChatGPT. IZA Discussion Paper No. 16992 doi: 10.2139/ssrn.4827166
2024-06: Kellogg, Katherine and Lifshitz-Assaf, Hila and Randazzo, Steven and Mollick, Ethan R. and Dell'Acqua, Fabrizio and McFowland III, Edward and Candelon, Francois and Lakhani, Karim R., Don't Expect Juniors to Teach Senior Professionals to Use Generative AI: Emerging Technology Risks and Novice AI Risk Mitigation Tactics (June 03, 2024). Harvard Business School Technology & Operations Mgt. Unit Working Paper 24-074, Harvard Business Working Paper No. 24-074, The Wharton School Research Paper doi: 10.2139/ssrn.4857373
2024-06: Delving into ChatGPT usage in academic writing through excess vocabulary
2024-09: The Rapid Adoption of Generative AI
2024-10: Growing Up: Navigating Generative AI’s Early Years – AI Adoption Report (executive summary, full report)
- 72% of leaders use genAI at least once a week (c.f. 23% in 2023); 90% agree AI enhances skills (c.f. 80% in 2023)
- Spending on genAI is up 130% (most companies plan to invest going forward)
2024-12: The unequal adoption of ChatGPT exacerbates existing inequalities among workers
- Higher adoption among young and less experienced
- Lower adoption among women and lower-earning workers
2025-02: The Widespread Adoption of Large Language Model-Assisted Writing Across Society: 10-25% adoption across a range of contexts
2025-02: Local Heterogeneity in Artificial Intelligence Jobs Over Time and Space
2025-04: Demand for LLMs: Descriptive Evidence on Substitution, Market Expansion, and Multihoming
2025-05: ChatGPT Is Still Leading the AI Wars but Google Gemini Is Gaining Ground
2025-05: Large Language Models, Small Labor Market Effects
- Significant uptake, but very little economic impact so far
2025-05: The Labor Market Effects of Generative Artificial Intelligence
- US worker usage of AI increasingly rapidly: 30% in 2024-12; 40% in 2025-05
2025-05: Trends – Artificial Intelligence
2025-06: Who is using AI to code? Global diffusion and impact of generative AI
2025-06: 2025 State of AI Report: The Builder’s Playbook A Practical Roadmap for AI Innovation
2025-07: METR: After the ChatGPT Moment: Measuring AI’s Adoption How quickly has AI been diffusing through the economy?
2025-07: Pew Research: 34% of U.S. adults have used ChatGPT, about double the share in 2023

Usage For

2024-12: Clio: A system for privacy-preserving insights into real-world AI use (Anthropic Clio)
2025-03: How People are Really Using Generative AI Now (writeup)
2025-04: Anthropic Education Report: How University Students Use Claude

Hiding Usage

2025-05: Underreporting of AI use: The role of social desirability bias

Psychological Impact

Human Sentiment towards AI

2025-04: Pew Research: How the U.S. Public and AI Experts View Artificial Intelligence

AI Persuasion of Humans

(AI can update beliefs, change opinions, tackle conspiracy theories, etc.)

AI Effects on Human Psychology

Human well-being

Counter loneliness

2024-07: AI Companions Reduce Loneliness
2025-03: How AI and Human Behaviors Shape Psychosocial Effects of Chatbot Use: A Longitudinal Controlled Study
2025-06: Anthropic: How People Use Claude for Support, Advice, and Companionship

@@ Line 26: / Line 26: @@
 * [https://www.deeplearning.ai/the-batch/gpt-4-boosts-remote-tutors-performance-in-real-time-study-finds/ LLM Support for Tutors GPT-4 boosts remote tutors’ performance in real time, study finds]
 ** [https://arxiv.org/abs/2410.03017 Tutor CoPilot: A Human-AI Approach for Scaling Real-Time Expertise]
+* 2025-06: Gallup & The Walton Foundation: [https://www.gallup.com/file/analytics/691922/Walton-Family-Foundation-Gallup-Teachers-AI-Report.pdf Teaching for Tomorrow Unlocking Six Weeks a Year With AI]
 ==AI harms learning==
@@ Line 138: / Line 139: @@
 * 2025-04: [https://www.nature.com/articles/s41586-025-08869-4?linkId=13898054 Towards accurate differential diagnosis with large language models]
 * 2025-06: [https://www.medrxiv.org/content/10.1101/2025.06.13.25329541v1 Automation of Systematic Reviews with Large Language Models]
+* 2025-06: [https://microsoft.ai/new/the-path-to-medical-superintelligence/ The Path to Medical Superintelligence]
 ====Bio====
@@ Line 181: / Line 183: @@
 * 2025-06: [https://www.medrxiv.org/content/10.1101/2025.06.07.25329176v1 From Tool to Teammate: A Randomized Controlled Trial of Clinician-AI Collaborative Workflows for Diagnosis]
 * 2025-06: [https://bmcmededuc.biomedcentral.com/articles/10.1186/s12909-025-07414-1 Iteratively refined ChatGPT outperforms clinical mentors in generating high-quality interprofessional education clinical scenarios: a comparative study]
+* 2025-07: [https://cdn.openai.com/pdf/a794887b-5a77-4207-bb62-e52c900463f1/penda_paper.pdf AI-based Clinical Decision Support for Primary Care: A Real-World Study] ([https://openai.com/index/ai-clinical-copilot-penda-health/ blog])
+* 2025-07: [https://arxiv.org/abs/2507.15743 Towards physician-centered oversight of conversational diagnostic AI]
 ===Translation===
@@ Line 208: / Line 212: @@
 * 2025-04: [https://arxiv.org/abs/2504.18919 Clinical knowledge in LLMs does not translate to human interactions]
 * 2025-05: [https://royalsocietypublishing.org/doi/10.1098/rsos.241776 Generalization bias in large language model summarization of scientific research]
+==AI lowers human productivity==
+* 2025-07: METR: [https://metr.org/Early_2025_AI_Experienced_OS_Devs_Study.pdf Measuring the Impact of Early-2025 AI on Experienced Open-Source Developer Productivity] ([https://metr.org/blog/2025-07-10-early-2025-ai-experienced-os-dev-study/ blog], [https://secondthoughts.ai/p/ai-coding-slowdown commentary/analysis])
 ==Human Perceptions of AI==
@@ Line 250: / Line 257: @@
 * 2025-05: [https://www.bondcap.com/report/pdf/Trends_Artificial_Intelligence.pdf Trends – Artificial Intelligence]
 * 2025-06: [https://arxiv.org/abs/2506.08945 Who is using AI to code? Global diffusion and impact of generative AI]
+* 2025-06: [https://www.iconiqcapital.com/growth/reports/2025-state-of-ai 2025 State of AI Report: The Builder’s Playbook] A Practical Roadmap for AI Innovation
+* 2025-07: METR: [https://epochai.substack.com/p/after-the-chatgpt-moment-measuring After the ChatGPT Moment: Measuring AI’s Adoption How quickly has AI been diffusing through the economy?]
+* 2025-07: Pew Research: [https://www.pewresearch.org/short-reads/2025/06/25/34-of-us-adults-have-used-chatgpt-about-double-the-share-in-2023/ 34% of U.S. adults have used ChatGPT, about double the share in 2023]
 ==Usage For==
@@ Line 273: / Line 283: @@
 ** [https://www.404media.co/researchers-secretly-ran-a-massive-unauthorized-ai-persuasion-experiment-on-reddit-users/ Researchers Secretly Ran a Massive, Unauthorized AI Persuasion Experiment on Reddit Users]
 * 2025-05: [https://arxiv.org/abs/2505.09662 Large Language Models Are More Persuasive Than Incentivized Human Persuaders]
+* 2025-07: [https://arxiv.org/abs/2507.13919 The Levers of Political Persuasion with Conversational AI]
 ==AI Effects on Human Psychology==
@@ Line 283: / Line 294: @@
 * 2024-07: [https://arxiv.org/abs/2407.19096 AI Companions Reduce Loneliness]
 * 2025-03: [https://dam-prod2.media.mit.edu/x/2025/03/21/Randomized_Control_Study_on_Chatbot_Psychosocial_Effect.pdf How AI and Human Behaviors Shape Psychosocial Effects of Chatbot Use: A Longitudinal Controlled Study]
+* 2025-06: Anthropic: [https://www.anthropic.com/news/how-people-use-claude-for-support-advice-and-companionship How People Use Claude for Support, Advice, and Companionship]
 ===Human mental abilities (creativity, learning)===
@@ Line 300: / Line 312: @@
 * 2025-04: [https://www.nber.org/papers/w33662 Measuring Human Leadership Skills with AI Agents]
 * 2025-04: [https://arxiv.org/abs/2504.10157 SocioVerse: A World Model for Social Simulation Powered by LLM Agents and A Pool of 10 Million Real-World Users]
+* 2025-07: [https://www.nature.com/articles/s41586-025-09215-4 A foundation model to predict and capture human cognition] ([https://marcelbinz.github.io/centaur code])
+* 2025-07: [https://arxiv.org/abs/2507.15815 LLM Economist: Large Population Models and Mechanism Design in Multi-Agent Generative Simulacra]
 =See Also=

Difference between revisions of "AI and Humans"

Revision as of 09:26, 28 July 2025

Contents

AI in Education

Survey/study of

AI improves learning/education

AI harms learning

Software/systems

LLMs

Individual tools

Systems

AI for grading

Detection

AI Text Detectors Don't Work

AI/human

Capabilities

Writing

AI out-performs humans

Tests

Creativity

Art

Business & Marketing

Professions

Coding

Medical

Bio

Therapy

Financial

AI improves human work

Coding

Forecasting

Finance

Law

Medical

Translation

Customer service

Creativity

Equity

AI worse than humans

AI lowers human productivity

Human Perceptions of AI

AI passes Turing Test

Uptake

Usage For

Hiding Usage

Psychological Impact

Human Sentiment towards AI

AI Persuasion of Humans

AI Effects on Human Psychology

Human well-being

Counter loneliness

Human mental abilities (creativity, learning)

Simulate Humans

See Also

Navigation menu

Search