A COMPARATIVE STUDY OF PRE-TRAINED CNN ARCHITECTURES FOR DETECTING AI-GENERATED VERSUS HUMAN-CREATED IMAGES

Authors

  • Ayat Abd-Muti Alrawahneh Universiti Kebangsaan Malaysia
  • Siti Norul Huda Sheikh Abdullah Universiti Kebangsaan Malaysia
  • Amelia Natasya Abdul Wahab Universiti Kebangsaan Malaysia
  • Sarah Khadijah Taylor CyberSecurity Malaysia
  • Nik Rafizal Nik Ab. Rahim HLA Integrated Sdn Bhd

Keywords:

AI-generated images; deepfake detection; convolutional neural networks; transfer learning; image forensics; model evaluation

Abstract

The widespread use of AI-generated imagery, enabled by advanced generative models, poses increasing challenges to digital content verification and authenticity. This study evaluates the performance of four widely adopted convolutional neural network (CNN) architectures—ResNet50, EfficientNetV2B0, InceptionV3, and VGG16—for classifying images as AI-generated or human-created. A balanced dataset of approximately 80,000 labeled images was used, and all models were trained using a consistent transfer learning pipeline with ImageNet pre-trained weights. Images were resized according to model-specific input dimensions and preprocessed using architecture-appropriate normalization methods. The dataset was split using an 80/10/10 ratio for training, validation, and testing, and each model was trained for eight epochs without data augmentation to focus on baseline performance.

The evaluation was conducted using training and validation accuracy and loss. ResNet50 achieved the highest validation accuracy (97.13%) and the lowest validation loss (0.0861), indicating strong generalization capability. EfficientNetV2B0 followed closely, while InceptionV3 and VGG16 performed slightly lower in both metrics. Visualization of training dynamics, including accuracy and loss curves, showed that all models converged effectively, with ResNet50 demonstrating the most stable and efficient learning trajectory. A final performance comparison chart further highlighted the superior performance of ResNet50 and EfficientNetV2B0. These findings underscore the effectiveness of pre-trained CNN architectures in distinguishing between synthetic and real visual content. The study also establishes a performance baseline for future work in AI-generated image detection, contributing to the broader field of multimedia forensics and trustworthy AI.

Downloads

Download data is not yet available.

Downloads

Published

26-06-2025

How to Cite

Alrawahneh, A. A.-M., Sheikh Abdullah, S. N. H. ., Abdul Wahab, A. N., Khadijah Taylor, S. ., & Rafizal Nik Ab. Rahim, N. (2025). A COMPARATIVE STUDY OF PRE-TRAINED CNN ARCHITECTURES FOR DETECTING AI-GENERATED VERSUS HUMAN-CREATED IMAGES. Malaysian Journal of Cybersecurity and Applications, 1(1), 50–67. Retrieved from https://jupidi.um.edu.my/index.php/mjca/article/view/60059

Most read articles by the same author(s)