ArXiv to Enforce One-Year Ban on Authors Using AI to Fully Generate Work

ArXiv to Enforce One-Year Ban on Authors Using AI to Fully Generate Work

2 Min Read

ArXiv, a popular open repository for preprint research, is intensifying its efforts to address the careless use of large language models (LLMs) in scientific papers. Despite being a pre-peer review platform, arXiv has become a crucial channel for research in fields like computer science and math, and it serves as a data source for scientific research trends.

To tackle the influx of low-quality, AI-generated papers, arXiv requires first-time posters to secure endorsements from established authors. The organization is transitioning into an independent nonprofit after over two decades with Cornell, which will enable it to raise funds to combat such issues.

Recently, Thomas Dietterich, chair of arXiv’s computer science section, stated that submissions containing undeniable evidence of unchecked LLM-generated results cannot be trusted. Evidence may include “hallucinated references” and comments involving LLMs. Authors whose papers contain such evidence face a one-year ban and must have subsequent submissions accepted by a reputable peer-reviewed venue.

This policy doesn’t ban LLM use outright, but Dietterich emphasized that authors must take full responsibility for their content. If inappropriate, plagiarized, biased, or incorrect content is copied from an LLM, authors are accountable. Dietterich mentioned this is a “one-strike” rule, requiring moderators to flag issues and section chairs to verify evidence before penalties are imposed. Authors can appeal decisions.

Peer-reviewed research indicates a rise in fabricated biomedical research citations, often due to LLMs, highlighting a broader issue beyond the scientific community.

You might also like