Revision as of 10:22, 30 December 2024

AI in Education

Survey/study of

2023-08: Perception, performance, and detectability of conversational artificial intelligence across 32 university courses
2023-10: Employees secretly using AI at work.
2023-10: Survey shows students using AI more than professors.
2023-11: ChatGPT has entered the classroom: how LLMs could transform education

AI improves learning/education

Mollick, Ethan R. and Mollick, Lilach and Bach, Natalie and Ciccarelli, LJ and Przystanski, Ben and Ravipinto, Daniel, AI Agents and Education: Simulated Practice at Scale (June 17, 2024). The Wharton School Research Paper. doi: 10.2139/ssrn.4871171
- Can enable personalized education.
Generative AI for Programming Education: Benchmarking ChatGPT, GPT-4, and Human Tutors
- GPT4 can out-perform human tutors.
Keppler, Samantha and Sinchaisri, Wichinpong and Snyder, Clare, Backwards Planning with Generative AI: Case Study Evidence from US K12 Teachers (August 13, 2024). doi: 10.2139/ssrn.4924786
- Teachers benefit from using AI as a co-pilot to aid in tasks (planning, how to teach topic, explore ideas).
- There is smaller utility in using AI purely as a text-generator (to make quizzes, workbooks, etc.).
Effective and Scalable Math Support: Evidence on the Impact of an AI- Tutor on Math Achievement in Ghana
AI Tutoring Outperforms Active Learning

AI harms learning

A real-world test of artificial intelligence infiltration of a university examinations system: A “Turing Test” case study
- Current grading systems cannot detect AI.
Bastani, Hamsa and Bastani, Osbert and Sungu, Alp and Ge, Haosen and Kabakcı, Özge and Mariman, Rei, Generative AI Can Harm Learning (July 15, 2024). The Wharton School Research Paper.doi: 10.2139/ssrn.4895486
- Access to ChatGPT harmed math education outcomes.
2024-09: AI Meets the Classroom: When Does ChatGPT Harm Learning?

Software/systems

GPTutor (code)
EduChat: A Large-Scale Language Model-based Chatbot System for Intelligent Education
Eureka Labs (founded by Andrej Karpathy) aims to create AI-driven courses (first course is Intro to LLMs)

LLMs

2024-12: LearnLM: Improving Gemini for Learning

Individual tools

Chatbot (OpenAI ChatGPT, Anthropic Claude, Google Gemini)
NotebookLM: Enables one to "chat with documents".
Google Learn About

AI for grading

Can Large Language Models Make the Grade? An Empirical Study Evaluating LLMs Ability To Mark Short Answer Questions in K-12 Education (preprint)

Detection

Do teachers spot AI? Evaluating the detectability of AI-generated texts among student essays
- GenAI can simulate student writing in a way that teachers cannot detect.
- AI essays are assessed more positively than student-written.
- Teachers are overconfident in their source identification.
- Both novice and experienced teachers could not identify texts generated by ChatGPT vs. students

AI Text Detectors Don't Work

AI/human

AI out-performs humans

Professions

Humanity's Last Exam
- Effort to build a dataset of challenging (but resolvable) questions in specific domain areas, to act as a benchmark to test whether AIs are improving in these challenging topics.

Medical

2024-03: Influence of a Large Language Model on Diagnostic Reasoning: A Randomized Clinical Vignette Study
- GPT4 improves medical practitioner work; surprisingly, GPT4 alone scored better than a human with GPT4 as aid (on selected tasks).
2024-10: Perspectives on Artificial Intelligence–Generated Responses to Patient Messages
2024-10: Large Language Model Influence on Diagnostic Reasoning; A Randomized Clinical Trial
- Use of ChatGPT does not strongly improve medical expert work; but AI alone out-scores human or human+AI
2024-11: Large language models surpass human experts in predicting neuroscience results (writeup: AI can predict neuroscience study results better than human experts, study finds)
2024-12: Superhuman performance of a large language model on the reasoning tasks of a physician
2024-12: HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

AI improves human work

2023-07: Experimental evidence on the productivity effects of generative artificial intelligence
2023-09: Dell'Acqua, Fabrizio and McFowland III, Edward and Mollick, Ethan R. and Lifshitz-Assaf, Hila and Kellogg, Katherine and Rajendran, Saran and Krayer, Lisa and Candelon, François and Lakhani, Karim R., Navigating the Jagged Technological Frontier: Field Experimental Evidence of the Effects of AI on Knowledge Worker Productivity and Quality (September 15, 2023). Harvard Business School Technology & Operations Mgt. Unit Working Paper No. 24-013, The Wharton School Research Paper doi: 10.2139/ssrn.4573321
2023-11: Generative AI at Work (National Bureau of Economic Research)
2023-12: The Uneven Impact of Generative AI on Entrepreneurial Performance (doi: 10.31219/osf.io/hdjpk)
2023-12: Artificial Intelligence in the Knowledge Economy: Non-autonomous AI (chatbot) benefits least knowledgeable workers; autonomous agents benefit the most knowledgeable workers
2024-07: Generative AI in Real-World Workplaces: The Second Microsoft Report on AI and Productivity Research

Coding

2023-02: The Impact of AI on Developer Productivity: Evidence from GitHub Copilot
2024-09: Cui, Zheyuan and Demirer, Mert and Jaffe, Sonia and Musolff, Leon and Peng, Sida and Salz, Tobias, The Effects of Generative AI on High Skilled Work: Evidence from Three Field Experiments with Software Developers (September 03, 2024). doi: 10.2139/ssrn.4945566
2024-11: Hoffmann, Manuel and Boysel, Sam and Nagle, Frank and Peng, Sida and Xu, Kevin, Generative AI and the Nature of Work (October 27, 2024). Harvard Business School Strategy Unit Working Paper No. 25-021, Harvard Business Working Paper No. No. 25-021, doi: 10.2139/ssrn.5007084

Forecasting

2024-02: AI-Augmented Predictions: LLM Assistants Improve Human Forecasting Accuracy

Creativity

2024-07: Generative AI enhances individual creativity but reduces the collective diversity of novel content
2024-08: An empirical investigation of the impact of ChatGPT on creativity
2024-10: Human Creativity in the Age of LLMs
2024-11: Artificial Intelligence, Scientific Discovery, and Product Innovation: diffusion model increases "innovation" (patents), boosts the best performers, but also removes some enjoyable tasks.
2024-12: Using AI to Generate Visual Art: Do Individual Differences in Creativity Predict AI-Assisted Art Quality? (preprint): shows that more creative humans produce more creative genAI outputs

Counter loneliness

2024-07: AI Companions Reduce Loneliness

Human Perceptions of AI

AI passes Turing Test

Text Dialog

2023-05: Human or Not? A Gamified Approach to the Turing Test
2023-10: Does GPT-4 pass the Turing test?
2024-05: People cannot distinguish GPT-4 from a human in a Turing test
2024-07: GPT-4 is judged more human than humans in displaced and inverted Turing tests

Art

2024-11: How Did You Do On The AI Art Turing Test? Differentiation was only slightly above random (60%). AI art was often ranked higher than human-made.
2024-11: AI-generated poetry is indistinguishable from human-written poetry and is rated more favorably

Uptake

2023-07: ChatGPT: Early Adopters, Teething Issues and the Way Forward
2024-03: Monitoring AI-Modified Content at Scale: A Case Study on the Impact of ChatGPT on AI Conference Peer Reviews
2024-05: Humlum, Anders and Vestergaard, Emilie, The Adoption of ChatGPT. IZA Discussion Paper No. 16992 doi: 10.2139/ssrn.4827166
2024-06: Kellogg, Katherine and Lifshitz-Assaf, Hila and Randazzo, Steven and Mollick, Ethan R. and Dell'Acqua, Fabrizio and McFowland III, Edward and Candelon, Francois and Lakhani, Karim R., Don't Expect Juniors to Teach Senior Professionals to Use Generative AI: Emerging Technology Risks and Novice AI Risk Mitigation Tactics (June 03, 2024). Harvard Business School Technology & Operations Mgt. Unit Working Paper 24-074, Harvard Business Working Paper No. 24-074, The Wharton School Research Paper doi: 10.2139/ssrn.4857373
2024-06: Delving into ChatGPT usage in academic writing through excess vocabulary
2024-09: The Rapid Adoption of Generative AI
2024-10: Growing Up: Navigating Generative AI’s Early Years – AI Adoption Report (executive summary, full report)
- 72% of leaders use genAI at least once a week (c.f. 23% in 2023); 90% agree AI enhances skills (c.f. 80% in 2023)
- Spending on genAI is up 130% (most companies plan to invest going forward)

Usage For

2024-12: Clio: A system for privacy-preserving insights into real-world AI use (Anthropic Clio)

@@ Line 74: / Line 74: @@
 ===Professions===
+* [https://agi.safe.ai/submit Humanity's Last Exam]
+** [https://x.com/alexandr_wang/status/1835738937719140440 Effort to build] a dataset of challenging (but resolvable) questions in specific domain areas, to act as a benchmark to test whether AIs are improving in these challenging topics.
+====Medical====
 * 2024-03: [https://www.medrxiv.org/content/10.1101/2024.03.12.24303785v1 Influence of a Large Language Model on Diagnostic Reasoning: A Randomized Clinical Vignette Study]
 ** GPT4 improves medical practitioner work; surprisingly, GPT4 alone scored better than a human with GPT4 as aid (on selected tasks).
@@ Line 81: / Line 85: @@
 * 2024-11: [https://www.nature.com/articles/s41562-024-02046-9 Large language models surpass human experts in predicting neuroscience results] (writeup: [https://medicalxpress.com/news/2024-11-ai-neuroscience-results-human-experts.html AI can predict neuroscience study results better than human experts, study finds])
 * 2024-12: [https://www.arxiv.org/abs/2412.10849 Superhuman performance of a large language model on the reasoning tasks of a physician]
-* [https://agi.safe.ai/submit Humanity's Last Exam]
+* 2024-12: [https://arxiv.org/abs/2412.18925 HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs]
-** [https://x.com/alexandr_wang/status/1835738937719140440 Effort to build] a dataset of challenging (but resolvable) questions in specific domain areas, to act as a benchmark to test whether AIs are improving in these challenging topics.
 ==AI improves human work==

Difference between revisions of "AI in education"

Revision as of 10:22, 30 December 2024

Contents

AI in Education

Survey/study of

AI improves learning/education

AI harms learning

Software/systems

LLMs

Individual tools

AI for grading

Detection

AI Text Detectors Don't Work

AI/human

AI out-performs humans

Tests

Creativity

Various

Professions

Medical

AI improves human work

Coding

Forecasting

Creativity

Counter loneliness

Human Perceptions of AI

AI passes Turing Test

Uptake

Usage For

See Also

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools