← All publications

Journal — A&A — 2026

RGC: a radio AGN classifier based on deep learning I. A semi-supervised multiclass model for VLA images

Mir Sazzat Hossain, Md. Shahadat Hossain Shahal, Khan Muhammad Bin Asad, Payaswini Saikia, Adrita Khan, Fatema Akter, Amin Ahsan Ali, Md Ashraful Amin, Deba Priyo Guha, M. O. B. Jihad, Arshad Momen, Siddhartho Sen, AKM Mahbubur Rahman

Accepted to Astronomy & Astrophysics.

Abstract

Bent radio active galactic nuclei (RAGNs)—wide-angle tails (WATs) and narrow-angle tails (NATs)—trace dense environments in galaxy groups and clusters, yet no multiclass classifier simultaneously separates them from straight Fanaroff—Riley types (sFRI, sFRII) using visually inspected labels and unlabelled data. We release RGC-D1, a four-class labelled dataset of 2060 RAGNs (sFRI, sFRII, WAT, NAT) constructed from three publicly available catalogues through multi-tier visual inspection, together with the semi-supervised RGC 1.0 model that leverages 20,000 unlabelled radio sources. We benchmark RGC 1.0 against five supervised baselines spanning CNNs and vision transformers. RGC-D1 is provided in two preprocessing variants: RL1\mathbf{R}_{L1}, which retains spurious sources, and RL2\mathbf{R}_{L2}, from which they are removed. The RGC model integrates the self-supervised framework BYOL (Bootstrap Your Own Latent) with an E(2)E(2)-equivariant steerable CNN (E2CNN) encoder, pre-trained on the unlabelled data and fine-tuned on the labelled sets. All six models are evaluated with 5-fold cross-validation, Grad-CAM attention analysis, and controlled class-imbalance experiments. ConvNeXT (M1M_1) and RGC (M2M_2) form a top tier at macro-F1F_1 0.80±0.020.80\pm0.02 and 0.79±0.020.79\pm0.02 respectively, a difference within one standard deviation. M2M_2 is the best-calibrated model (ECE 0.07\approx 0.07) and the only one whose Grad-CAM contours consistently trace the morphological structure of RAGNs—lobes, jets, and bends—rather than defaulting to compact blobs or diffuse patterns. The four-class scheme introduced here enables WAT/NAT-resolved catalogues that can serve as environment probes and progenitor classifications for diffuse cluster radio emission. The complementary strengths of M1M_1 and M2M_2—in cross-type and within-type discrimination respectively—suggest that an ensemble approach may offer a practical framework for survey-scale morphological catalogues.

Topics

Galaxies: active Radio continuum: galaxies Techniques: image processing Methods: data analysis Methods: statistical

Cite

@article{_Qo2XoVZTnwC,
  title     = {RGC: a radio AGN classifier based on deep learning I. A semi-supervised multiclass model for VLA images},
  author    = {Mir Sazzat Hossain and Md. Shahadat Hossain Shahal and Khan Muhammad Bin Asad and Payaswini Saikia and Adrita Khan and Fatema Akter and Amin Ahsan Ali and Md Ashraful Amin and Deba Priyo Guha and M. O. B. Jihad and Arshad Momen and Siddhartho Sen and AKM Mahbubur Rahman},
  journal = {Astronomy & Astrophysics},
  year      = {2026}
}