Skip to main content
Loading...
Scholar9 logo True scholar network
  • Login/Sign up
  • Scholar9
    Publications ▼
    Article List Deposit Article
    Mentorship ▼
    Overview Sessions
    Q&A Institutions Scholars Journals
    Publications ▼
    Article List Deposit Article
    Mentorship ▼
    Overview Sessions
    Q&A Institutions Scholars Journals
  • Login/Sign up
  • Back to Top

    Transparent Peer Review By Scholar9

    ASD-Pipeline: An Ensemble Machine Learning Framework Integrating Feature Selection, Behavioural Clustering, and Class Rebalancing for Accurate Autism Spectrum Disorder Prediction

    Abstract

    Autism Spectrum Disorder (ASD) is a complex neurodevelopmental condition characterized by a variety of behavioral and cognitive patterns. Early and precise detection is critical in enabling timely interventions. Conventional classification models frequently exhibit poor generalization due to irrelevant features, unstructured behavioral data, and severe class imbalance. Despite current advances in machine learning for ASD detection, current models do not integrate adaptive feature selection, behavioral grouping, or imbalanced class handling in a unified, end-to-end pipeline. The lack of incorporation frequently results in suboptimal performance and limited interpretability. This study proposes a new ensemble-based framework called ASD-Pipeline, which integrates flexible feature selection, hybrid clustering, synthetic minority oversampling, and ensemble voting classification to improve the predictive performance for ASD identification. The proposed ASD-Pipeline framework uses a five-stage process to improve the accuracy of autism spectrum disorder prediction. First, the dataset is normalized utilizing Min-Max scaling to guarantee that the feature ranges remain consistent. Next, feature selection is performed utilizing FlexiFeat, an ensemble method integrating filter-based (CfsSubsetEval with BestFirst), wrapper-based (WrapperSubsetEval with GreedyStepwise), and embedded (ReliefF with Ranker) techniques to maintain only the most pertinent feature. The ClusterGroup stage uses K-Means clustering (k=5) and DBSCAN improvement (ε=0.5, minPts=3) within each cluster to create behavioral groups and remove outliers. The ReBalance stage uses Cluster-SMOTE to tackle class imbalance by producing synthetic samples for the minority class and a balanced dataset. Finally, the ASDClassifier stage involves training an ensemble of Logistic Regression, Support Vector Machine, and Gradient Boosting classifiers that are combined using soft voting. Metrics used to assess the model include accuracy, precision, recall, F1-score, and Matthews Correlation Coefficient (MCC). The proposed ASD-Pipeline surpassed existing models, achieving a significantly higher accuracy of 96.18% compared to previous techniques ranging from 76.80% to 90.60%. It also scored 91.51% precision, 91.63% recall, 95.57% F1-score, and 92.51% specificity. These findings emphasize the pipeline's efficacy in enhancing generalization and tackling difficulties such as feature relevance, behavioral grouping, and class imbalance for ASD prediction. The ASD-Pipeline offers a reliable, interpretable, and modular machine learning solution for ASD prediction. Its incorporated method tackles critical challenges in feature relevance, behavioral variability, and data imbalance, rendering it a promising tool for healthcare practitioners and researchers seeking data-driven insights into early ASD detection.

    User Profile
    User Profile
    User Profile
    User Profile
    User Profile

    Rajesh Tirupathi Reviewer

    badge Review Request Accepted

    Rajesh Tirupathi Reviewer

    badge Approved

    Relevance and Originality

    Methodology

    Validity & Reliability

    Clarity and Structure

    Results and Analysis

    Relevance and Originality

    This research tackles a pressing problem in the field of neurodevelopmental diagnostics by offering a unified pipeline for early ASD detection, something often missing in previous approaches. The originality lies in merging feature selection, behavioral grouping, and class balancing into one cohesive framework. By structuring these diverse elements into a modular design, the work advances the field beyond conventional, disjointed models and aligns closely with real-world clinical demands for interpretable and scalable solutions in autism prediction.

    Methodology

    The study employs a layered approach where each stage of the pipeline addresses a known weakness in ASD data processing. The normalization phase prepares the dataset efficiently, while the FlexiFeat component smartly merges multiple feature selection techniques, enhancing dimensional relevance. The hybrid clustering using K-Means and DBSCAN provides behavioral structuring, and the integration of Cluster-SMOTE at the class rebalancing stage directly confronts data skewness. Concluding with a soft-voting classifier ensemble ensures classification strength, making this method not only well-rounded but technically refined.

    Validity & Reliability

    The performance outcomes provide a solid foundation for the study’s claims. High scores across various metrics—including accuracy, recall, and MCC—suggest a model that is both precise and generalizable. The balanced evaluation methodology ensures each component of the model contributes effectively to the outcome. However, while internal validation appears thorough, future studies should explore how well this framework performs across diverse population datasets to fully confirm its reliability in broader clinical applications.

    Clarity and Structure

    The structure of the research is systematic and easy to follow, guiding the reader through the five pipeline phases in a logical manner. Terminologies are well explained, and the transitions between concepts maintain coherence. The description of each technical process is sufficiently detailed, without overwhelming the reader. Some aspects, such as the interaction between clustering and synthetic oversampling, might benefit from visual representation to enhance understanding of the methodology’s flow.

    Result Analysis

    The reported outcomes are comprehensive and directly support the utility of the proposed system. By presenting a comparative performance benchmark against existing techniques, the research highlights its innovation and superiority clearly. The interpretation of results is consistent with the design goals of the pipeline and showcases how thoughtful integration of machine learning elements can lead to substantial improvement in ASD classification.

    IJ Publication Publisher

    Respected Sir,

    Thank you for your detailed and thoughtful evaluation. We value your recognition of our modular framework’s novelty and methodological rigor, as well as your constructive suggestion to validate on diverse populations and improve visual explanations. These points are crucial for enhancing both the reliability and clarity of our ASD detection model.

    We appreciate your time and insights—thank you once again.

    Publisher

    User Profile

    IJ Publication

    Reviewers

    User Profile

    Rajesh Tirupathi

    User Profile

    Rajkumar Kyadasu

    User Profile

    Hrishikesh Rajesh Mane

    User Profile

    Rahul Arulkumaran

    User Profile

    PRONOY CHOPRA

    More Detail

    User Profile

    Paper Category

    Computer Engineering

    User Profile

    Journal Name

    IJNRD - INTERNATIONAL JOURNAL OF NOVEL RESEARCH AND DEVELOPMENT

    User Profile

    p-ISSN

    User Profile

    e-ISSN

    2456-4184

    Subscribe us to get updated

    logo logo

    Scholar9 is aiming to empower the research community around the world with the help of technology & innovation. Scholar9 provides the required platform to Scholar for visibility & credibility.

    QUICKLINKS

    • What is Scholar9?
    • About Us
    • Mission Vision
    • Contact Us
    • Privacy Policy
    • Terms of Use
    • Blogs
    • FAQ

    CONTACT US

    • +91 82003 85143
    • hello@scholar9.com
    • www.scholar9.com

    © 2026 Sequence Research & Development Pvt Ltd. All Rights Reserved.

    whatsapp