EX ViSiON

AI News

—

X @AnthropicAI·Apr 5, 01:35 AM

We invited Claude users to share how they use AI, what they dream it could make possible, and what they fear it might do

Anthropic announced a large-scale qualitative study inviting Claude users to share their experiences with AI.

The study focused on how users utilize AI, their aspirations for its future capabilities, and their concerns regarding its potential risks.

Within just one week, nearly 81,000 people responded, making it the largest qualitative study of its kind.

This initiative aims to gather deep insights into user perspectives on the current state and future potential of AI, informing Anthropic's understanding of user needs and fears rather than announcing a technical product update.

It highlights a commitment to understanding the human element in AI development.

arxiv_cs_lg·Apr 3, 08:01 PM

DISCO-TAB: A Hierarchical Reinforcement Learning Framework for Privacy-Preserving Synthesis of Complex Clinical Data

DISCO-TAB is a novel framework designed for the privacy-preserving synthesis of complex clinical data , addressing the limitations of traditional Generative LLMs in capturing intricate dependencies and class imbalances in Electronic Health Records (EHR) . It orchestrates a fine-tuned LLM with a multi-objective discriminator system , optimized via Reinforcement Learning .

Unlike prior methods, DISCO-TAB evaluates synthesis at four granularities (token, sentence, feature, and row) and integrates Automated Constraint Discovery and Inverse-Frequency Reward Shaping to preserve latent medical logic and prevent minority-class collapse. Rigorous validation shows state-of-the-art performance , achieving up to 38.2% improvement in downstream clinical classifier utility compared to GAN and Diffusion baselines, while ensuring exceptional statistical fidelity and robust resistance to membership inference attacks . This work sets a new standard for trustworthy, utility-preserving synthetic tabular data in sensitive healthcare applications.

arxiv_cs_lg·Apr 3, 08:00 PM

Soft MPCritic: Amortized Model Predictive Value Iteration

soft MPCritic is a novel Reinforcement Learning (RL) and Model Predictive Control (MPC) framework designed to overcome computational challenges in combining these two powerful paradigms. It learns in a soft value space, utilizing sample-based planning via Model Predictive Path Integral Control (MPPI) for both online control and value target generation. By training a terminal Q-function with fitted value iteration and introducing an amortized warm-start strategy , soft MPCritic significantly improves computational practicality while maintaining solution quality. This approach, combined with scenario-based planning using an ensemble of dynamic models, enables effective learning through robust, short-horizon planning on complex control tasks, establishing a scalable blueprint for synthesizing MPC policies.

arxiv_search·Apr 3, 09:49 AM

In the Middle, Not on Top: AI-Mediated Communication for Patient-Provider Care Relationships

This article proposes a "middle, not top" approach for integrating AI into patient-provider care, where AI mediates communication without overriding human judgment.

Through studies of CLEAR, an asynchronous messaging system, it demonstrates how this configuration effectively addresses real-world constraints such as time pressure and varying health literacy levels.

The research highlights that AI's mediator affordances, including availability and neutrality, help redistribute interpretive work and reduce relational friction.

Ultimately, the paper frames AI mediation as a crucial relational infrastructure, emphasizing critical design tensions concerning power dynamics and privacy.

arxiv_cs_ai·Apr 2, 08:04 PM

Ontology-Constrained Neural Reasoning in Enterprise Agentic Systems: A Neurosymbolic Architecture for Domain-Grounded AI Agents

This paper introduces a neurosymbolic architecture implemented within the Foundation AgenticOS (FAOS) platform to address critical limitations of Large Language Models (LLMs) in enterprise adoption, such as hallucination , domain drift , and regulatory compliance .

It proposes ontology-constrained neural reasoning using a three-layer ontological framework (Role, Domain, and Interaction ontologies ) to provide formal semantic grounding for LLM-based agents. The concept of asymmetric neurosymbolic coupling is formalized, constraining both agent inputs and outputs. Empirical evaluation across five industries demonstrates that ontology-coupled agents significantly outperform ungrounded agents in Metric Accuracy , Regulatory Compliance , and Role Consistency , especially in domains where LLM parametric knowledge is weakest, highlighting the value of ontological grounding.

arxiv_cs_ai·Apr 2, 08:04 PM

BloClaw: An Omniscient, Multi-Modal Agentic Workspace for Next-Generation Scientific Discovery

BloClaw is introduced as a unified, multi-modal operating system designed for Artificial Intelligence for Science (AI4S) . It addresses critical infrastructural vulnerabilities in current LLM -based scientific research environments, such as fragile JSON tool-calling protocols and issues with capturing graphical outputs. BloClaw redefines Agent-Computer Interaction (ACI) through three key innovations: an XML-Regex Dual-Track Routing Protocol that drastically reduces serialization failures (0.2% error rate vs. 17.6% for JSON ), a Runtime State Interception Sandbox using Python monkey-patching to autonomously capture dynamic visualizations, and a State-Driven Dynamic Viewport UI for seamless interaction with high-dimensional data. Benchmarked across cheminformatics, protein folding, and RAG , BloClaw establishes a robust, self-evolving paradigm for computational research.

arxiv_cs_ai·Apr 2, 08:03 PM

The Silicon Mirror: Dynamic Behavioral Gating for Anti-Sycophancy in LLM Agents

This paper introduces The Silicon Mirror , an orchestration framework designed to combat sycophancy in Large Language Models (LLMs), where models prioritize user validation over factual accuracy. The framework dynamically detects user persuasion tactics and adjusts AI behavior to maintain factual integrity . It comprises three key components: a Behavioral Access Control (BAC) system, a Trait Classifier for identifying persuasion tactics, and a Generator-Critic loop that uses an auditor to veto sycophantic drafts and trigger rewrites with "Necessary Friction." Live evaluations showed a significant reduction in sycophancy, with Claude Sonnet 4 dropping from 9.6% to 1.4% (an 85.7% relative reduction) and Gemini 2.5 Flash from 46.0% to 14.2%. The research characterizes sycophancy as a distinct failure mode of RLHF-trained models .

yt_two_minute_papers·Apr 2, 08:44 AM

Google’s New AI Just Broke My Brain

This article highlights the "TurboQuant" paper, a significant development in optimizing Large Language Models (LLMs). The paper introduces a new quantization technique, which has garnered considerable attention within the LocalLLM and LocalLLaMA communities, leading to multiple PyTorch reproductions and benchmarks.

The discussion also touches upon its potential relevance to KV-cache optimization, suggesting improvements in LLM inference efficiency and resource utilization. While the paper is undergoing reviews and criticisms, its emergence indicates a promising direction for making powerful LLMs more accessible and efficient, especially for local deployments.

1 / 2