ISSN: 2459-1777 | E-ISSN 2587-0394
Volume : 10 Issue : 3 Year : 2025
pdf

Comparative Detector Analysis for the Identification of Academic Articles Synthesized by Artificial Intelligence in the Field of Ophthalmology [Beyoglu Eye J]
Beyoglu Eye J. 2025; 10(3): 175-180 | DOI: 10.14744/bej.2025.70973

Comparative Detector Analysis for the Identification of Academic Articles Synthesized by Artificial Intelligence in the Field of Ophthalmology

Sebnem Kaya Ergen
Department of Ophthalmology, Kocaeli State Hospital, Kocaeli, Türkiye

OBJECTIVES: The increasing use of large language models, such as ChatGPT, in academic writing has raised significant ethical concerns within the academic community. This study explores the potential challenges posed by the ability of artificial intelligence (AI) to produce realistic, evidence-based academic texts and investigates whether these challenges can be effectively controlled.
METHODS: Three original articles in the field of ophthalmology were provided as input to ChatGPT-4o to generate introduction sections. A total of 50 introduction texts were synthesized from 150 original articles. These AI-generated texts were analyzed using AI detectors (GPTZero, Writer, CorrectorApp, and ZeroGPT) and a plagiarism detector. In addition, the ability of AI detectors to differentiate between original and AI-generated texts was evaluated.
RESULTS: There was a statistically significant difference in AI detector probabilities between original and AI-generated texts (p<0.001 for all detectors). GPTZero demonstrated a sensitivity of 100% and a specificity of 96% in distinguishing original from AI-generated texts, outperforming all other AI detectors. However, paraphrased AI-generated texts significantly reduced the detection accuracy of GPTZero (p<0.001).
DISCUSSION AND CONCLUSION: ChatGPT-4o demonstrated the ability to synthesize new texts with referenced citations within seconds, capable of bypassing plagiarism detectors. However, AI detectors showed limitations in achieving absolute accuracy and occasionally misclassified original texts. Even with the most accurate AI detectors, a simple paraphrasing method significantly compromised prediction accuracy, highlighting the need for improved detection strategies and ethical oversight.

Keywords: Artificial intelligence detection, Artificial intelligence ethic, ChatGPT-4o, large language models, research policy

Corresponding Author: Sebnem Kaya Ergen, Türkiye
Manuscript Language: English
×
APA
NLM
AMA
MLA
Chicago
Copied!
CITE
LookUs & Online Makale