Welcome to NLDML 2026

5th International Conference on NLP, Data Mining and Machine Learning (NLDML 2026)

January 27 ~ 28, 2026, Virtual Conference



Accepted Papers
The Cymc Multimedia Progression Model: A Video-supported Framework for Youth Performance Learning

Peiwen Su, California Youth Music Competition (CYMC), USA

ABSTRACT

This study presents the CYMC Multimedia Progression Model (MPM), a technology-enhanced framework developed across the California Youth Music Competition’s regional and international tiers. Because CYMC administers all levels of competition, it implements unified, high-standard multimedia recording that supports both reflective learning and advancement. Drawing on more than 200 professionally captured performance videos from young musicians, the study examines how standardized multimedia documentation functions as a digital portfolio for international selection, enabling students to progress from local performances to CYMC’s global stages. Findings indicate that this multimedia-supported cycle enhances self-efficacy, expressive behavior, and goal-directed motivation. The upward pathway—where each regional performance carries the potential for international exposure—creates a strong motivational loop and strengthens learners’ artistic identity. The model offers a scalable technology-driven approach for developing confidence, resilience, and engagement in youth performance education.

Keywords

multimedia-supported learning, performance pedagogy, digital portfolio, international progression, youth music education.


Hierarchical Worker Evaluation Based on Requester’s Subjective Criteria in Open-Ended Crowdsourcing Tasks

Ryuya Itano1,Honoka Tanitsu1,Motoki Bamba1, Takahiro Koita1,Ryota Noseyama2,Akihito Kohiga2, 1 Graduate School of Science and Engineering, Doshisha University, Kyoto, Japan,2 Faculty of Science and Engineering, Doshisha University, Kyoto, Japan .

ABSTRACT

Crowdsourcing assumes a transient relationship between task requesters and workers, which makes it hard for workers to improve their skills. In addition, with the emergence of AI, crowd work is shifting from simple tasks to more complex and open-ended ones, highlighting the importance of training workers to handle such tasks. Although various methods have been proposed to train and evaluate workers, a method to evaluate them in open-ended tasks among workers has not yet been established. In this study, we propose applying a hierarchical inter-worker evaluation structure based on workers’ skill levels to the evaluation of open-ended tasks, and examine how closely it aligns with the requesters’ subjective evaluation criteria. The experimental results showed that workers’ evaluations were highly aligned with the requesters’ subjective evaluation criteria in terms of relative worker rankings. However, the alignment was weaker for absolute scores, due to workers’ tendency toward generous scoring. These findings are expected to be utilized in future research to enhance worker engagement and retention rates.

Keywords

Crowdsourcing, Worker training, Worker evaluation, Amazon Mechanical Turk


Navigating the Digital Divide: Cognitive Load and Emotional Well-Being in Adolescents aged (10-17) with Dyslexia

Kirstie Muchugia, Kisii University, Kenya

ABSTRACT

As digital environments become the primary domain for adolescent socialization and education, the implications of user interface (UI) design on neurodiverse populations remain critically under-explored. This review article investigates the cognitive and emotional impact of digital ecosystems on adolescents aged 10–17 diagnosed with dyslexia a developmental window crucial for the formation of self-esteem and social identity. We will analyze the cognitive perception of digital spaces, arguing that standard design patterns such as infinite scrolling, high-density text, and non-linear navigation exacerbate cognitive load. Evidence suggests that these features overwhelm the working memory deficits associated with dyslexia, leading to rapid cognitive fatigue compared to neurotypical peers. Furthermore we will examine the emotional consequences of these barriers. The review synthesizes literature linking digital reading struggles to heightened anxiety, "lurking" behaviors, and increased susceptibility to cyberbullying in textbased environments. However, the analysis also identifies a paradox: while text-heavy platforms generate stress, audio-visual and gamified environments often serve as "safe harbors" that empower dyslexic youth. The article concludes that current digital mental health frameworks overlook the role of interface design. We argue that to protect the mental well-being of adolescents with dyslexia, the industry must move beyond standard accessibility compliance toward active, neuro-inclusive design..

Keywords

Dyslexia, Adolescents, Digital Mental Health, Cognitive Load, User Interface(UI), Neurodiversity


Professional Competencies In School And Career Guidance Counseling: A Theoretical And Empirical Perspective

Haliotou Catherine, Greece

ABSTRACT

This article examines the multidimensional nature of professional competence in school and career guidance counseling. Grounded in qualitative and interpretive research, it explores how counselors enact and interrelate communicative, relational, informational, reflexive, and ethical competencies in practice. Drawing on a corpus of anonymized counseling sessions, the study integrates theories of situated action, reflective practice, and dialogical ethics to construct a comprehensive understanding of professionalization in guidance work. Findings indicate that competence functions not as a fixed attribute or measurable skill but as an ongoing, context-dependent process of reasoning, reflection, and ethical judgment. The discussion highlights the implications of this conceptualization for counselor training, continuing professional development, and policy frameworks in the field of educational and career guidance.

Keywords

Professional competence; guidance counseling; reflective practice; professionalization; dialogical ethics; qualitative research.


Adaptive Natural Language Processing-Based Test Automation Framework: Enabling Self-Healing And Context-Aware Test Cases

Partha Sarathi Samal1, Suresh Kumar Palus 2, Sai Kiran Padmam 3 , 1 Independent ResearcherConnecticut, USA ,2 Independent Researcher Pennsylvania, USA , 3Independent Researcher New Jersey, USA

ABSTRACT

This study presents the CYMC Multimedia Progression Model (MPM), a technology-enhanced framework developed across the California Youth Music Competition’s regional and international tiers. Because CYMC administers all levels of competition, it implements unified, high-standard multimedia recording that supports both reflective learning and advancement. Drawing on more than 200 professionally captured performance videos from young musicians, the study examines how standardized multimedia documentation functions as a digital portfolio for international selection, enabling students to progress from local performances to CYMC’s global stages. Findings indicate that this multimedia-supported cycle enhances self-efficacy, expressive behavior, and goal-directed motivation. The upward pathway—where each regional performance carries the potential for international exposure—creates a strong motivational loop and strengthens learners’ artistic identity. The model offers a scalable technology-driven approach for developing confidence, resilience, and engagement in youth performance education.

Keywords

Natural Language Processing, Test Automation, Self-Healing Tests, Machine Learning, Semantic Analysis, AI-Driven QA, Test Maintenance, Context-Aware Testing, Named Entity Recognition, Specification-Driven Development


Flood Prediction Using Machine Learning: A Review

Abdalla Alamen1, Wyatt Clausen 2, Naseef Mansoor3 , 1College of Science, Engineering, and Technology, Minnesota State University, Mankato, USA ,2 College of Science, Engineering, and Technology, Minnesota State University, Mankato, USA ,3College of Science, Engineering, and Technology, Minnesota State University, Mankato, USA

ABSTRACT

Floods are destructive and frequent natural disasters. Because of this, machine learning models have been developed in an attempt to predict flooding. Furthermore, this project aims to review a variety of methods such as Long Short-Term Memory (LSTM), LightGBM, Multilayer Perceptron, Support Vector Machine, and Random Forests in their ability to predict floods using a multivariate dataset of historical flood data from Bangladesh (1949-2014) and a time-series dataset for the Minnesota River (2019-2025). The performance metrics of interest for this project were accuracy, precision, recall, F1-Score, Mean Square Error (MSE) and its root (RMSE), Nash-Sutcliffe Efficiency (NSE), and Kling-Gupta Efficiency (KGE). In addition, confusion matrices and ROC curves were developed in order to judge model performance. From this project, the LightGBM model worked best for the Bangladesh data while the LSTM worked best for the time-series data. In addition, the most important features for the LightGBM model were rainfall, recording location, and year.

Keywords

Flood Detection, LSTM, LightGBM, Machine Learning


Three-Dimensional Empowerment": Exploring The Path For Counselors To Strengthen College Students' Cultural Confidence

Lin Shi , Shool of Computer Science and Artificial Intelligence, Wuhan University of Technology, Wuhan, Hubei, China

ABSTRACT

Cultural confidence is the spiritual bond of national rejuvenation and the core essence of character education in universities. Cultivating college students' cultural confidence is of paramount importance in the current era. As grassroots forces in ideological and political education, university counselors play an irreplaceable role in addressing the various challenges in fostering cultural confidence and in transforming macro-level theoretical concepts in-to micro-level educational practices. This paper, based on the practical work of counselors, systematically constructs a "Three-Dimensional Empowerment System" of "narrative transformation — belief immersion — space cultivation" to offer a practical education model. Additionally, it discusses a long-term support mechanism from the perspectives of counselors' professional development and institutional guarantees, exploring the process of cultural confidence cultivation from path exploration to ecological system construction. The goal is to provide theoretical support and practical examples for the professional development of counselors in cultural education, helping to cultivate a new generation with a strong cultural foundation and firm value stance.

Keywords

Counselor, Cultural Confidence, Practical pathway


A Methodological Study Of Retrieval-Augmented Generation For Knowledge-Intensive NLP Tasks

Shekinah Manyi Achidi , Istanbul Topkapi University, Türkiye

ABSTRACT

Retrieval-Augmented Generation (RAG) has emerged as an ef ective approach for addressing knowledge-intensive natural language processing tasks by combining neural retrieval mechanisms with large language models. Unlike purely parametric models, RAG systems dynamically access external knowledge sources, improving factual accuracy, interpretability, and adaptability. This paper presents a methodological study of RAG architectures, with particular emphasis on agentic RAG workflows that enable iterative retrieval, reasoning, and response generation. The study analyzes core system components, retrieval strategies, and evaluation practices, and discusses practical application scenarios such as question answering, document analysis, and decision support systems. The findings highlight the advantages of RAG-based approaches in reducing hallucinations and enhancing reliability, while also identifying key challenges and future research directions. This work aims to provide a structured foundation for researchers and practitioners developing robust RAG-based NLP systems.

Keywords

Large Language Models, Natural Language Processing, Retrieval Augmented Generation, Knowledge Intensive Tasks, Information Retrieval, Agentic Rag


Iterative Rating Through Voting Algorithm-Multi Parameter Aggregation

Manjula Pilaka, K V N Sunitha and Aruna Rao S L , Computer Science and Engineering, BVRIT Hyderabad College of Engineering for Women

ABSTRACT

With the rapid growth of academic search engines, the visibility and impact of scholarly articles depend heavily on how effectively they are ranked. While systems such as Google Scholar emphasize citation counts[2], community sentiment and other contextual attributes can also provide important signals of trustworthiness. Similarly, in professional platforms like LinkedIn, the credibility of recommendations depends not only on the recommender’s opinion but also on their expertise and standing within the community.This paper introduces an Iterative Voting Algorithm with Multi-Parameter Aggregation (VMPA) that integrates multiple voter attributes into the ranking process. The algorithm is designed to mitigate individual collusion, group collusion, and biased voting, while ensuring that authentic but infrequent voters are not overlooked. Each voter is associated with measurable parameters such as academic experience, citation record, and organizational affiliation, which are incorporated into their voting weight. The method is evaluated using both real and synthetic datasets in the context of conference ranking. Results show that the algorithm successfully identifies colluders, preserves genuine contributions, and produces rankings consistent with credible benchmarks such as CORE. The findings suggest that multi-parameter aggregation improves robustness, enhances trustworthiness, and reduces the number of iterations required for convergence.

Keywords

MPIA - Multi Parameter Identification and Aggregation VMPA - Voting with Multi Parameter Aggregation