Difference between revisions of "AI and Humans"

From GISAXS
Jump to: navigation, search
(AI worse than humans)
(Simulate Humans)
 
(15 intermediate revisions by the same user not shown)
Line 140: Line 140:
 
* 2025-06: [https://www.medrxiv.org/content/10.1101/2025.06.13.25329541v1 Automation of Systematic Reviews with Large Language Models]
 
* 2025-06: [https://www.medrxiv.org/content/10.1101/2025.06.13.25329541v1 Automation of Systematic Reviews with Large Language Models]
 
* 2025-06: [https://microsoft.ai/new/the-path-to-medical-superintelligence/ The Path to Medical Superintelligence]
 
* 2025-06: [https://microsoft.ai/new/the-path-to-medical-superintelligence/ The Path to Medical Superintelligence]
 +
* 2025-08: [https://www.nature.com/articles/s41591-025-03888-0?utm_source=chatgpt.com A personal health large language model for sleep and fitness coaching]
 +
* 2025-08: [https://arxiv.org/abs/2508.08224 Capabilities of GPT-5 on Multimodal Medical Reasoning]
  
 
====Bio====
 
====Bio====
Line 152: Line 154:
 
====Financial====
 
====Financial====
 
* 2024-07: [https://arxiv.org/abs/2407.17866 Financial Statement Analysis with Large Language Models]
 
* 2024-07: [https://arxiv.org/abs/2407.17866 Financial Statement Analysis with Large Language Models]
 +
 +
====HR====
 +
* 2025-08: [https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5395709 Voice AI in Firms: A Natural Field Experiment on Automated Job Interviews]
  
 
==AI improves human work==
 
==AI improves human work==
Line 183: Line 188:
 
* 2025-06: [https://www.medrxiv.org/content/10.1101/2025.06.07.25329176v1 From Tool to Teammate: A Randomized Controlled Trial of Clinician-AI Collaborative Workflows for Diagnosis]
 
* 2025-06: [https://www.medrxiv.org/content/10.1101/2025.06.07.25329176v1 From Tool to Teammate: A Randomized Controlled Trial of Clinician-AI Collaborative Workflows for Diagnosis]
 
* 2025-06: [https://bmcmededuc.biomedcentral.com/articles/10.1186/s12909-025-07414-1 Iteratively refined ChatGPT outperforms clinical mentors in generating high-quality interprofessional education clinical scenarios: a comparative study]
 
* 2025-06: [https://bmcmededuc.biomedcentral.com/articles/10.1186/s12909-025-07414-1 Iteratively refined ChatGPT outperforms clinical mentors in generating high-quality interprofessional education clinical scenarios: a comparative study]
 +
* 2025-07: [https://cdn.openai.com/pdf/a794887b-5a77-4207-bb62-e52c900463f1/penda_paper.pdf AI-based Clinical Decision Support for Primary Care: A Real-World Study] ([https://openai.com/index/ai-clinical-copilot-penda-health/ blog])
 +
* 2025-07: [https://arxiv.org/abs/2507.15743 Towards physician-centered oversight of conversational diagnostic AI]
  
 
===Translation===
 
===Translation===
Line 256: Line 263:
 
* 2025-06: [https://arxiv.org/abs/2506.08945 Who is using AI to code? Global diffusion and impact of generative AI]
 
* 2025-06: [https://arxiv.org/abs/2506.08945 Who is using AI to code? Global diffusion and impact of generative AI]
 
* 2025-06: [https://www.iconiqcapital.com/growth/reports/2025-state-of-ai 2025 State of AI Report: The Builder’s Playbook] A Practical Roadmap for AI Innovation
 
* 2025-06: [https://www.iconiqcapital.com/growth/reports/2025-state-of-ai 2025 State of AI Report: The Builder’s Playbook] A Practical Roadmap for AI Innovation
 +
* 2025-07: METR: [https://epochai.substack.com/p/after-the-chatgpt-moment-measuring After the ChatGPT Moment: Measuring AI’s Adoption How quickly has AI been diffusing through the economy?]
 +
* 2025-07: Pew Research: [https://www.pewresearch.org/short-reads/2025/06/25/34-of-us-adults-have-used-chatgpt-about-double-the-share-in-2023/ 34% of U.S. adults have used ChatGPT, about double the share in 2023]
  
 
==Usage For==
 
==Usage For==
Line 264: Line 273:
 
==Hiding Usage==
 
==Hiding Usage==
 
* 2025-05: [https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5232910 Underreporting of AI use: The role of social desirability bias]
 
* 2025-05: [https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5232910 Underreporting of AI use: The role of social desirability bias]
 +
 +
=Societal Effects/Transformations=
 +
* 2025-09: [https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5425555 Generative AI as Seniority-Biased Technological Change: Evidence from U.S. Résumé and Job Posting Data]
  
 
=Psychological Impact=
 
=Psychological Impact=
 +
* 2025-08: [https://arxiv.org/abs/2508.16628 The Impact of Artificial Intelligence on Human Thought]
 +
 
==Human Sentiment towards AI==
 
==Human Sentiment towards AI==
 
* 2025-04: Pew Research: [https://www.pewresearch.org/internet/2025/04/03/how-the-us-public-and-ai-experts-view-artificial-intelligence/ How the U.S. Public and AI Experts View Artificial Intelligence]
 
* 2025-04: Pew Research: [https://www.pewresearch.org/internet/2025/04/03/how-the-us-public-and-ai-experts-view-artificial-intelligence/ How the U.S. Public and AI Experts View Artificial Intelligence]
Line 279: Line 293:
 
** [https://www.404media.co/researchers-secretly-ran-a-massive-unauthorized-ai-persuasion-experiment-on-reddit-users/ Researchers Secretly Ran a Massive, Unauthorized AI Persuasion Experiment on Reddit Users]
 
** [https://www.404media.co/researchers-secretly-ran-a-massive-unauthorized-ai-persuasion-experiment-on-reddit-users/ Researchers Secretly Ran a Massive, Unauthorized AI Persuasion Experiment on Reddit Users]
 
* 2025-05: [https://arxiv.org/abs/2505.09662 Large Language Models Are More Persuasive Than Incentivized Human Persuaders]
 
* 2025-05: [https://arxiv.org/abs/2505.09662 Large Language Models Are More Persuasive Than Incentivized Human Persuaders]
 +
* 2025-07: [https://arxiv.org/abs/2507.13919 The Levers of Political Persuasion with Conversational AI]
  
 
==AI Effects on Human Psychology==
 
==AI Effects on Human Psychology==
Line 297: Line 312:
 
=Simulate Humans=
 
=Simulate Humans=
 
* See also: [[Human brain]]
 
* See also: [[Human brain]]
 +
 +
==Sociology==
 
* 2021-10: [https://www.doi.org/10.1007/s10588-021-09351-y Explaining and predicting human behavior and social dynamics in simulated virtual worlds: reproducibility, generalizability, and robustness of causal discovery methods]
 
* 2021-10: [https://www.doi.org/10.1007/s10588-021-09351-y Explaining and predicting human behavior and social dynamics in simulated virtual worlds: reproducibility, generalizability, and robustness of causal discovery methods]
 
* 2023-12: Google: [https://arxiv.org/abs/2312.03664 Generative agent-based modeling with actions grounded in physical, social, or digital space using Concordia]
 
* 2023-12: Google: [https://arxiv.org/abs/2312.03664 Generative agent-based modeling with actions grounded in physical, social, or digital space using Concordia]
Line 308: Line 325:
 
* 2025-04: [https://arxiv.org/abs/2504.10157 SocioVerse: A World Model for Social Simulation Powered by LLM Agents and A Pool of 10 Million Real-World Users]
 
* 2025-04: [https://arxiv.org/abs/2504.10157 SocioVerse: A World Model for Social Simulation Powered by LLM Agents and A Pool of 10 Million Real-World Users]
 
* 2025-07: [https://www.nature.com/articles/s41586-025-09215-4 A foundation model to predict and capture human cognition] ([https://marcelbinz.github.io/centaur code])
 
* 2025-07: [https://www.nature.com/articles/s41586-025-09215-4 A foundation model to predict and capture human cognition] ([https://marcelbinz.github.io/centaur code])
 +
* 2025-07: [https://arxiv.org/abs/2507.15815 LLM Economist: Large Population Models and Mechanism Design in Multi-Agent Generative Simulacra]
 +
* 2025-09: [https://benjaminmanning.io/files/optimize.pdf General Social Agents]
 +
 +
==Theory of Mind==
 +
* 2025-08: [https://www.nature.com/articles/s44387-025-00031-9 How large language models encode theory-of-mind: a study on sparse parameter patterns]
 +
 +
==Humanlike Vibes==
 +
* 2025-07: [https://arxiv.org/abs/2507.20525 The Xeno Sutra: Can Meaning and Value be Ascribed to an AI-Generated "Sacred" Text?]
 +
 +
==Skeptical==
 +
* 2025-08: [https://arxiv.org/abs/2508.06950 Large Language Models Do Not Simulate Human Psychology]
  
 
=See Also=
 
=See Also=

Latest revision as of 15:57, 4 September 2025

AI in Education

Survey/study of

AI improves learning/education

AI harms learning

Software/systems

LLMs

Individual tools

Systems

AI for grading

Detection

AI Text Detectors Don't Work

AI/human

Capabilities

Writing

AI out-performs humans

Tests

Creativity

Art

Business & Marketing

Professions

  • Humanity's Last Exam
    • Effort to build a dataset of challenging (but resolvable) questions in specific domain areas, to act as a benchmark to test whether AIs are improving in these challenging topics.

Coding

Medical

Bio

Therapy

Financial

HR

AI improves human work

Coding

Forecasting

Finance

Law

Medical

Translation

Customer service

  • 2023-11: Generative AI at Work: Improvements for workers and clients (though also a ceiling to improvement)

Creativity

Equity

AI worse than humans

AI lowers human productivity

Human Perceptions of AI

AI passes Turing Test

Text Dialog

Art

Uptake

Usage For

Hiding Usage

Societal Effects/Transformations

Psychological Impact

Human Sentiment towards AI

AI Persuasion of Humans

(AI can update beliefs, change opinions, tackle conspiracy theories, etc.)

AI Effects on Human Psychology

Human well-being

Counter loneliness

Human mental abilities (creativity, learning)

Simulate Humans

Sociology

Theory of Mind

Humanlike Vibes

Skeptical

See Also