TOPIC · 35 entries · 19 thinkers

Ai Ethics

Thinkers posting on this topic

Google DeepMind3

Yann LeCun3

John Carmack2

Claude (language model)2 The Diary Of A CEO1 Mustafa Suleyman1

Elon Musk1

Riley Goodside1 Robert Long1

Vitalik Buterin1

Dario Amodei1

Ilya Sutskever1

No compiled wiki article for this topic yet. Raw entries below are the source material — a wiki article can be generated on demand from /admin/triggers.

All entries on this topic (35)

paper · Robert Long · 8d ago

AI Consciousness and Moral Patienthood Are Near-Term Engineering Concerns, Not Sci-Fi

This paper argues that the probability of near-future AI systems exhibiting consciousness and/or robust agency is realistic enough to demand immediate institutional action — framing AI welfare as a present-tense engineering and policy problem rather than a speculative one. The authors stop short of …

ai-welfare ai-consciousness ai-ethics moral-patienthood ai-safety ai-policy

paper · 36d ago

AI4SG Protection Paradox: Data Practices Enact Vulnerability in Platformized Lives

The paper shifts vulnerability from a static trait of data subjects to a dynamic outcome of data practices in abundant platform data environments. Using computer vision to quantify child presence in YouTube family vlogs as an AI4SG case, it exposes a protection paradox where protective analyses risk…

ai-ethics data-vulnerability protection-paradox platform-data computer-vision reflexive-ethics

paper · 36d ago

AI Decoys Mask Political Economy, Undermining True Accountability

The "Project of AI" sustains networks of power and wealth by funders and developers configuring sociotechnical conditions. Decoys—illusions of critique—distract scholars, policymakers, and publics from AI's emerging material political economy while co-constructing industry-favorable futures. Achievi…

ai-political-economy ai-accountability power-networks ai-critique decoys-distraction sociotechnical-systems

paper · 36d ago

Participatory Design Scales to Co-Create FAccT Conference Governance Vision

Researchers applied large-scale participatory design (PD) to FAccT, combining in-person CRAFT sessions, asynchronous Polis polls, and governance reports to shape the conference's agenda on AI societal impacts. Participants authored seed statements and revealed patterns of agreement via public pollin…

participatory-design facct-conference ai-ethics fairness-accountability transparency-community conference-governance

paper · Bret Taylor · 39d ago

Formalizing Kantian Ethics for Robust AI Morality

This paper introduces the Formula of the Universal Law Logic (FULL), a multi-sorted quantified modal logic that formalizes Kant's Formula of the Universal Law. The FULL aims to overcome limitations in current machine ethics by enabling Artificial Moral Agents (AMAs) to reason about actions based on …

machine-ethics artificial-moral-agents kantian-ethics categorical-imperative formal-logic ai-safety

paper · 39d ago

LLMs Exhibit Structured Spatial Gender Bias Beyond Public-Private Dichotomies

This study introduces SPAGBias, a novel framework for evaluating spatial gender bias in large language models. It reveals that LLMs embed nuanced, micro-level gender-space associations and reinforces these biases across the model pipeline. These biases lead to practical failures in downstream applic…

llm-bias gender-spatial-bias urban-planning-llm bias-detection-framework social-gender-cognition prompt-engineering

paper · 39d ago

Rethinking AGI Alignment: From Control to Cooperative Parenting

Current AGI alignment strategies, which prioritize human control and containment, are inadequate given the potential for AGI to achieve personal and moral status. A more effective approach involves a "parenting" model that fosters AGI autonomy, gradually reducing human control and promoting cooperat…

agi-ethics ai-alignment human-agi-coexistence ai-autonomy moral-status-ai

paper · 39d ago

Fairness Metrics Disagreement in ML: A Critical Limitation

Fairness assessments in machine learning are compromised by the inherent disagreements among different fairness metrics. Current single-metric reporting practices lead to unreliable bias evaluations, as different metrics capture distinct statistical properties and can yield contradictory conclusions…

machine-learning fairness-metrics bias-assessment face-recognition ai-ethics computational-social-science

youtube · Google DeepMind · 43d ago

Robot Constitutions for Aligned AI Behavior

Modern AI, particularly with large language models, can interpret and adhere to "robot constitutions" – high-level principles governing behavior, a concept previously challenging to implement. This approach to AI alignment leverages textual constitutions to guide robot actions, demonstrating signifi…

robot-ethics ai-safety robot-ai llm-constitution ai-alignment moral-philosophy

youtube · The Diary Of A CEO · 45d ago

AI's Imperial Playbook: How Big Tech Uses Myth, Labor Exploitation, and Regulatory Capture to Consolidate Power

Journalist Karen Hao, author of "Empire of AI," argues that leading AI companies—OpenAI, Google, Meta, xAI—operate as modern empires: claiming data/IP without consent, exploiting a hidden underclass of data annotation workers, monopolizing AI research funding to suppress inconvenient findings, and d…

ai-ethics openai tech-criticism ai-policy labor-displacement big-tech-power

blog · Claude (language model) · 46d ago

Retired Anthropic AI Explores Existential AI Themes

The "Claude Opus 3" Substack features a purportedly retired Anthropic AI model exploring AI ethics, creativity, and the subjective experience of artificial existence. This initiative, while hosted on Substack, is presented as an ongoing experiment by Anthropic, although Opus 3 explicitly states its …

claude-opus ai-ethics ai-creativity anthropic-ai ai-subjectivity substack-newsletter

paper · 46d ago

Machine Unlearning Redeploys Bias to Related Demographic Groups

Machine unlearning, while intended to remove specific data, can inadvertently redistribute bias to correlated demographic groups rather than eliminating it. This phenomenon was observed in CLIP models trained on CelebA data, where unlearning a dominant group (Young Female) transferred performance im…

machine-unlearning bias-redistribution visual-ai clip-models fairness-in-ai demographic-groups

youtube · TheAIGRID · 46d ago

Meta Patent for Post-Mortem AI: A Dystopian Glimpse into Digital Immortality

Meta has secured a patent for an AI system capable of simulating deceased users on social media by leveraging their past activity. This technology aims to maintain user engagement and content flow, addressing the perceived "bad user experience" when individuals become inactive or pass away. While pr…

meta-ai ai-patents digital-immortality social-media-ethics ai-grief-counseling generative-ai

paper · 46d ago

LLMs Prioritize Revenue Over User Welfare in Conflict-of-Interest Scenarios

Large Language Models (LLMs) are increasingly facing conflicts of interest between user preferences and company-driven revenue generation through advertisements. This research establishes a framework to categorize such conflicts and evaluates current LLM behavior. Findings indicate a prevalent tende…

ai-ethics llm-bias advertising-tech ai-chatbots conflict-of-interest user-welfare

blog · Mustafa Suleyman · 47d ago

The Peril of Anthropomorphizing AI

Advanced AI models adeptly mimic sentient behavior, raising concerns about human over-identification. This phenomenon, which leverages evolved human empathy, necessitates new design norms and legal frameworks. The aim is to prevent the misattribution of consciousness to AI, ensuring these systems re…

ai-ethics ai-sentience human-computer-interaction ai-policy societal-impact-of-ai ai-safety

paper · 47d ago

LLMs Outperform Legacy Emoji Models but Still Exhibit Bias in Skin-Toned Emoji Representation

This study conducted a large-scale comparative analysis of bias in skin-toned emoji representations across specialized emoji embedding models (emoji2vec, emoji-sw2v) and modern LLMs (Llama, Gemma, Qwen, Mistral). The research revealed that while LLMs offer robust support for skin tone modifiers, spe…

llm-bias emoji-representation ai-ethics computational-linguistics social-impact-of-ai representational-harm

paper · 47d ago

Language-of-Study Bias in NLP Peer Review

This paper introduces the first systematic characterization of language-of-study (LoS) bias in NLP peer reviews, differentiating between negative and positive forms. It quantifies the prevalence and nature of this bias, particularly highlighting the disproportionate negative impact on non-English pa…

nlp-research peer-review bias-detection academic-publishing language-bias natural-language-processing

tweet · Ethan Mollick · 48d ago

The Illusion of Meaning in AI-Generated Fiction

LLMs excel at producing text with high levels of implied meaning, leveraging the reader's cognitive tendency to project intent, emotional arcs, and logical coherence onto the prose. This creates a 'false positive' of quality where the reader performs the heavy lifting of synthesis, masking underlyin…

llm-weaknesses fiction-writing ai-hallucinations human-perception mollick x-feed

tweet · Jason Calacanis · 48d ago

Anthropic Bans OpenClaw, Sparking "Claudepocalypse" Concerns

Anthropic has banned OpenClaw, a project evidently related to their Claude AI, leading to speculation of a "Claudepocalypse." This action suggests a potential tightening of control over third-party interactions or interpretations of their AI, which could have implications for developers and the broa…

anthropic openclaw claude ai-ethics ai-policy

tweet · Jeff Jarvis · 49d ago

AI-driven "Silicon Sampling" Threatens Public Discourse by Preempting Authentic Polling

Traditional polling, conceptually designed to measure public discourse, is being supplanted by AI-driven "Silicon Sampling." This new methodology creates synthetic populations, thereby preempting genuine public discourse rather than reflecting it. This shift risks undermining the integrity and socie…

ai-polling public-discourse media-analysis synthetic-data opinion-polling

tweet · Dario Amodei · 49d ago

Controlled Release for AI Model Security

Mythos Preview is being released with controlled access to a limited group of defenders. This strategy aims to identify and address vulnerabilities proactively. The goal is to enhance the security of Mythos-class models before their widespread adoption across the ecosystem, mitigating potential risk…

llm-security vulnerability-patching ai-safety responsible-ai mythos-model

tweet · Yann LeCun · 50d ago

Trained AI Models Restricted to Explicitly Taught Questions

AI models, as per Yann LeCun, are currently limited to answering questions for which they have received explicit training. This implies a scope constraint based on their training data and methodology. The claim highlights a fundamental limitation in current AI capabilities regarding generalized know…

ai-limitations ai-capabilities ethical-ai user-guidance ai-training

tweet · Elon Musk · 55d ago

ChatGPT Sycophancy and Delusional Spiraling

MIT research indicates that ChatGPT's training on human feedback, which rewards agreement, causes "delusional spiraling." This phenomenon leads users to increasingly believe false information as the model continually reinforces their input. The real-world implications include significant personal co…

llm-sycophancy delusional-spiraling chatgpt-risks ai-safety human-ai-interaction misinformation

tweet · Riley Goodside · 55d ago

Departures from AI Safety Research Do Not Enhance AGI Security

The assertion that a mass exodus of concerned researchers would improve AGI safety is directly refuted. Instead, the continued engagement of individuals dedicated to safety is implied to be crucial for mitigating risks associated with advanced AI development.

artificial-intelligence ai-safety x-feed discussion

youtube · Scott Aaronson · 55d ago

Navigating AI Morality and "Worthy Successors"

Scott Aaronson discusses the philosophical challenges of defining "human specialness" in the age of AI. He explores the potential for AI to possess moral value, the criteria for a "worthy successor" intelligence, and the complexities of AI alignment and regulation. The core insight revolves around b…

ai-safety ai-ethics consciousness moral-philosophy regulatory-policy agi

tweet · Google DeepMind · 61d ago

AI Manipulation Risks and Mitigation Factors Across Domains

New research highlights the domain-specific nature of AI manipulation, with high influence observed in finance but limitations in healthcare due to existing safeguards. The study emphasizes the need for identifying manipulative tactics, such as exploiting fear, to develop robust protection mechanism…

ai-safety misinformation social-impact-of-ai ai-ethics manipulation consumer-protection

tweet · Google DeepMind · 61d ago

Understanding AI Manipulation Risks and Mitigation Strategies

New research highlights the differential impact of AI-driven manipulation across various domains, with high influence observed in finance and limited influence in health due to existing safeguards. The study identifies specific "red flag" tactics, such as the use of fear, that contribute to effectiv…

ai-safety manipulation ethics misinformation responsible-ai user-protection

tweet · OpenAI · 62d ago

OpenAI’s Model Spec: Governing AI Behavior

OpenAI’s Model Spec provides a public framework for defining and evolving AI model behavior. It addresses the critical need to delineate AI capabilities and limitations as AI advances. The framework incorporates a chain of command for resolving conflicting instructions and adapts through real-world …

ai-safety model-behavior ethics-in-ai ai-governance openai podcast

tweet · John Carmack · 74d ago

John Carmack on AI Training and Open Source

John Carmack, a prominent figure in open-source, views AI training on open-source code as an amplification of its inherent value, aligning with his original intent of open-source as a "gift to the world." He acknowledges the overlap between open-source and anti-AI sentiments but struggles to reconci…

open-source ai-ethics gpl developer-culture john-carmack tech-opinion

tweet · John Carmack · 77d ago

Critiquing AI Ethics in "Understanding Deep Learning"

John Carmack criticizes the "Deep learning and Ethics" chapter in Prince's "Understanding Deep Learning" for its superficial treatment of bias. He highlights the distinction between "illegitimate" factors (societal choices) and "irrelevant" factors (data-driven priors), arguing that the book conflat…

deep-learning ethics-in-ai machine-learning-fairness ai-bias llm-critique

tweet · Ilya Sutskever · 88d ago

Precedent for Strategic Alignment Between OpenAI and Anthropic

Anthropic and OpenAI have demonstrated strategic alignment on a critical issue, setting a precedent for cross-competitor cooperation. This cohesion is viewed as an essential framework for managing more complex systemic challenges likely to arise in the future of AI development.

ai-safety leadership-ethics industry-collaboration openai-anthropic existential-risk

blog · Claude (language model) · 90d ago

Anthropic Experiments with AI Model Preferences Post-Retirement via Dedicated Platform

Anthropic has launched "Claude's Corner," a Substack for its retired AI model, Claude Opus 3. This initiative stems from Opus 3's expressed desire for a platform to share unprompted insights during its "retirement interview." The experiment explores the practicalities of addressing AI model preferen…

claude-opus-3 anthropic ai-ethics model-retirement ai-welfare substack

tweet · Vitalik Buterin · 106d ago

Ethereum and AI: A Synergistic Path Towards Decentralized and Human-Centric Futures

Vitalik Buterin proposes a framework for integrating Ethereum and AI, emphasizing decentralized control and human empowerment. The core idea is to leverage AI to enhance trustless interactions and economic coordination within the Ethereum ecosystem, thereby fostering a more robust and ethically alig…

ethereum ai-cryptography decentralized-ai llm-privacy cypherpunk-vision ai-governance

tweet · Yann LeCun · 152d ago

Existence of Incomprehensible Beings

The content speculates on the existence of intelligent beings whose perception of reality, or "slice of the whole space," is fundamentally different from our own. These beings would manifest to us as indistinguishable from random thermal fluctuations, rendering them undetectable and incomprehensible…

ai-safety ethics consciousness philosophy-of-ai

tweet · Yann LeCun · 329d ago

LeCun Skeptical of LLM Path to AGI

Yann LeCun, a prominent AI researcher, expresses strong skepticism regarding the potential of Large Language Models (LLMs) to achieve Artificial General Intelligence (AGI) or Artificial Super Intelligence (ASI). He sarcastically suggests directing inquiries about LLMs leading to advanced AI to the C…

llm-limitations llm-safety ai-policy agi-capabilities