Academic

Mirror: A Multi-Agent System for AI-Assisted Ethics Review

Yifan Ding, Yuhui Shi, Zhiyan Li, Zilong Wang, Yifeng Gao, Yajun Yang, Mengjie Yang, Yixiu Liang, Xipeng Qiu, Xuanjing Huang, Xingjun Ma, Yu-Gang Jiang, Guoyu Wang · March 7, 2026 · 1 min read · 19 views

#cs.AI

arXiv:2602.13292v1 Announce Type: new Abstract: Ethics review is a foundational mechanism of modern research governance, yet contemporary systems face increasing strain as ethical risks arise as structural consequences of large-scale, interdisciplinary scientific practice. The demand for consistent and defensible decisions under heterogeneous risk profiles exposes limitations in institutional review capacity rather than in the legitimacy of ethics oversight. Recent advances in large language models (LLMs) offer new opportunities to support ethics review, but their direct application remains limited by insufficient ethical reasoning capability, weak integration with regulatory structures, and strict privacy constraints on authentic review materials. In this work, we introduce Mirror, an agentic framework for AI-assisted ethical review that integrates ethical reasoning, structured rule interpretation, and multi-agent deliberation within a unified architecture. At its core is EthicsLLM, a foundational model fine-tuned on EthicsQA, a specialized dataset of 41K question-chain-of-thought-answer triples distilled from authoritative ethics and regulatory corpora. EthicsLLM provides detailed normative and regulatory understanding, enabling Mirror to operate in two complementary modes. Mirror-ER (expedited Review) automates expedited review through an executable rule base that supports efficient and transparent compliance checks for minimal-risk studies. Mirror-CR (Committee Review) simulates full-board deliberation through coordinated interactions among expert agents, an ethics secretary agent, and a principal investigator agent, producing structured, committee-level assessments across ten ethical dimensions. Empirical evaluations demonstrate that Mirror significantly improves the quality, consistency, and professionalism of ethics assessments compared with strong generalist LLMs.

Executive Summary

The article introduces Mirror, a multi-agent system designed to enhance AI-assisted ethics review in research governance. It addresses the growing challenges faced by traditional ethics review systems, particularly in handling large-scale, interdisciplinary research with heterogeneous risk profiles. Mirror integrates ethical reasoning, structured rule interpretation, and multi-agent deliberation through a framework that includes EthicsLLM, a model fine-tuned on a specialized dataset of ethical and regulatory questions. The system operates in two modes: Mirror-ER for expedited review and Mirror-CR for simulating full-board deliberation. Empirical evaluations show that Mirror improves the quality, consistency, and professionalism of ethics assessments compared to generalist LLMs.

Key Points

▸ Mirror is a multi-agent system designed to support AI-assisted ethics review.
▸ It integrates ethical reasoning, structured rule interpretation, and multi-agent deliberation.
▸ Mirror operates in two modes: expedited review (Mirror-ER) and full-board deliberation (Mirror-CR).
▸ EthicsLLM, a foundational model fine-tuned on EthicsQA, provides normative and regulatory understanding.
▸ Empirical evaluations demonstrate significant improvements in ethics assessments.

Merits

Innovative Framework

Mirror introduces a novel approach to ethics review by integrating multiple agents and specialized models, addressing the limitations of traditional review systems.

Comprehensive Ethical Reasoning

The use of EthicsLLM, fine-tuned on a large dataset of ethical and regulatory questions, provides a robust foundation for ethical reasoning and decision-making.

Empirical Validation

The system's effectiveness is supported by empirical evaluations, demonstrating improvements in the quality, consistency, and professionalism of ethics assessments.

Demerits

Privacy Constraints

The strict privacy constraints on authentic review materials may limit the system's ability to handle sensitive or confidential information.

Integration Challenges

Integrating Mirror with existing regulatory structures and institutional review processes may pose significant challenges.

Ethical Reasoning Limitations

Despite advancements, the ethical reasoning capabilities of LLMs may still be insufficient for complex or nuanced ethical dilemmas.

Expert Commentary

The introduction of Mirror represents a significant advancement in the field of AI-assisted ethics review. By leveraging multi-agent systems and specialized models, the framework addresses critical limitations in traditional review processes, particularly in handling large-scale, interdisciplinary research. The empirical validation of Mirror's effectiveness is a strong indicator of its potential to enhance the quality and consistency of ethics assessments. However, challenges remain, particularly in integrating the system with existing regulatory structures and addressing privacy constraints. The successful implementation of Mirror will require careful consideration of these issues, as well as ongoing evaluation and refinement of the system's ethical reasoning capabilities. Policymakers and institutions must also be prepared to adapt their governance frameworks to accommodate the use of AI in ethics review, ensuring that the benefits of such systems are realized without compromising the integrity of the review process.

Recommendations

✓ Further research should focus on improving the ethical reasoning capabilities of LLMs to handle more complex and nuanced ethical dilemmas.
✓ Institutions should invest in the necessary training and infrastructure to integrate Mirror into their existing review systems, ensuring seamless and effective implementation.

Sources

arXiv - cs.AI

Mirror: A Multi-Agent System for AI-Assisted Ethics Review

AI Commentary

Executive Summary

Key Points

Merits

Innovative Framework

Comprehensive Ethical Reasoning

Empirical Validation

Demerits

Privacy Constraints

Integration Challenges

Ethical Reasoning Limitations

Expert Commentary

Recommendations

Sources

Related Articles

ConstitutionGPT: An AI-Powered Multilingual Legal Assistance System for Indian Citizens

AI Copyright Infringement: Navigating the Legal Risks of AI-Generated Content

The Rhetoric of Machine Learning

Busemann energy-based attention for emotion analysis in Poincar\'e discs

JCG, PC

HSOLLC Co., Ltd.

Related Articles

ConstitutionGPT: An AI-Powered Multilingual Legal Assistance System for Indian Citizens

AI Copyright Infringement: Navigating the Legal Risks of AI-Generated Content

The Rhetoric of Machine Learning

Busemann energy-based attention for emotion analysis in Poincar\'e discs