Your search
Results 216 resources
-
The increasing volume of suspicious emails, commonly known as spam, has created a critical need for more reliable and robust anti-spam filters. These suspicious emails can be dangerous and can lead to the loss of personal information, underscoring the necessity for an effective spam filtering system. The application of machine learning methods has enhanced system security and improved the detection of suspicious messages. This research evaluates the effectiveness of seven machine learning algorithms for classifying suspicious email messages: random forest, support vector machine, artificial neural network, decision tree, gradient boosting classifier, and k-nearest neighbor. The primary focus of this evaluation is the accuracy achieved by each algorithm in identifying spam emails. Our analysis revealed that the random forest algorithm outperformed the other evaluated algorithms in terms of accuracy for spam email classification, achieving a remarkable 95%. The accuracy percentages of the various methods ranged from 88% to 93%. Copyright 2025. The Korean Institute of Information Scientists and Engineers.
-
Traditional physical biometrics - such as fingerprints, facial recognition, and iris scans - have long been utilized for user identification in areas like border control, military operations, law enforcement, and public safety. However, the rise of smartphone technology has introduced new avenues for security research. One emerging area is behavioral biometrics, particularly the use of touchscreen interaction data as a more accessible and user-friendly identification method. In this context, our study focuses on a novel form of touchscreen input: capacitive swipe gestures for user identification. We compiled a comprehensive dataset of capacitive swipe gestures collected over multiple sessions from 30 participants. To evaluate this modality, we conducted thorough experiments using established machine learning algorithms, including Support Vector Machine, Random Forest, and XGBoost. Additionally, we developed a new preprocessing algorithm tailored for capacitive swipe data. Our findings reveal that this algorithm significantly enhances identification performance compared to existing methods. Overall, our results highlight the strong potential of capacitive swipe gestures as a viable biometric modality for user identification. © 2025 IEEE.
-
Tomato plant diseases pose a significant threat to agricultural productivity, resulting in substantial economic losses. Early and accurate diagnosis is crucial for effective disease management. This paper describes the design and implementation of expert systems for tomato disease detection using the CLIPS (C Language Integrated Production System) platform. The tool is designed to help farmers and agronomists accurately identify diseases affecting tomato crops by simulating knowledge from professional experts. We carefully developed a set of rules to distinguish leaf blight symptoms from those of other tomato diseases and provided recommendations to minimize crop losses and maximize yields. The expert system was developed using a forward-chaining inference engine, and its performance was evaluated through a set of real-world test cases, demonstrating a high level of accuracy and consistency in decision-making. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.
-
With the increasing interest in natural language processing, text summarization has become essential for condensing large volumes of data into concise and meaningful summaries. Extractive summarization, which involves selecting key sentences based on textual features, has gained attention due to its efficiency and effectiveness. This research explores extractive summarization using multiple machine learning classifiers, including Support Vector Machines (SVM), Logistic Regression (LR), Decision Trees (DT), K-Nearest Neighbors (KNN), and Random Forest (RF). Our findings indicate that the Random Forest model achieved the highest accuracy, reaching 80% in classifying sentences for summary generation. Additionally, we evaluated text classification on the same BBC dataset using ChatGPT, which attained an accuracy of 62%. Furthermore, comparisons with results from prior research confirm the competitive performance of our approach, reinforcing the potential of machine learning models in extractive summarization. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.
-
The rapid growth of technology has brought about many advantages, but has also made networks more susceptible to security threats. Intrusion Detection Systems (IDS) play a vital role in protecting computer networks against malicious activities. Given the dynamic and constantly evolving nature of cyber threats, these systems must continuously adapt to maintain their effectiveness. Machine Learning (ML) methods have gained prominence as effective tools for constructing IDS that offer both high accuracy and efficiency. This study conducts a performance assessment of several machine learning classifiers, including Random Forests (RF), Decision Trees (DT), and Support Vector Machines (SVM), in addressing multiclass intrusion detection as a means to counter cybersecurity threats. The NSL-KDD dataset, which includes various network attacks, served as the basis for our experimental evaluation. The research explores two classification scenarios: a five-class and a three-class model, analyzing their impact on detection performance. The results demonstrate that RF consistently achieves the highest accuracy (85.42%) on the three-class scenario testing set, highlighting its effectiveness in handling patterns and non-linear relationships within the intrusion data. Furthermore, reducing the classification complexity (three classes vs. five classes) significantly improves model generalization, as evidenced by the reduced performance gap between training and testing data. Friedman’s rank test and Holm’s post-hoc analysis were applied to ensure statistical rigor, confirming that RF outperforms DT and SVM in all evaluation metrics. These findings establish RF as the most robust classifier for intrusion detection and underscore the importance of simplifying classification tasks for improved IDS performance. © (2025), (Science Publications). All rights reserved.
-
Coronary heart disease (CHD) is the leading global cause of death, making early detection essential. While coronary angiography is the diagnostic gold standard, its invasive nature poses risks, and non-invasive symptom-based methods often lack accuracy. Machine learning-powered computer-aided diagnostic systems can effectively address challenges in clinical decisionmaking. This work presents an Evolutionary Strategy-optimized Support Vector Machine (ES-SVM) model for classifying CHD based on non-invasive test results and patient characteristics. Using the Coronary Heart Disease dataset, the proposed ESSVM demonstrated significant precision and F1-scores, as well as the accuracy of the proposed model. The results indicate that SVM performance can be significantly enhanced through evolutionary hyperparameter tuning, resulting in a reliable, noninvasive diagnostic tool for initial CAD screening and supporting early intervention techniques. © 2025 IEEE.
-
Modeling lipase activity aids researchers in optimizing features such as temperature, pH, and substrate concentration to boost enzyme performance. This is essential in biotechnology for progressing the productivity and yield of processes such as fermentation, biodiesel production, and bioremediation. Fermentation is a highly complex, multivariable, and non-linear biotechnological process that produces bioactive materials. This study leverages artificial neural networks (ANN) to predict lipase activity in batch fermentation processes, addressing the inherent challenges in weight learning optimization often encountered with traditional algorithms like Backpropagation (BP). Several metaheuristic algorithms were employed to optimize the Multilayered Perceptron (MLP) structure and weights, including moth-frequency optimization (MFO), Particle Swarm Optimization (PSO), Dandelion Optimizer Algorithm (DO), Crow Search Algorithm (CSA), and Salp Swarm Algorithm (SSA) to overcome these limitations. Among the tested algorithms, MFO emerged as the most effective approach, achieving superior performance in weight learning with the best fitness value (i.e., mean square error (MSE)) of 0.6006. MFO-optimized ANN models deliver the most accurate predictions for lipase activity, highlighting their potential as a powerful tool for advancing industrial fermentation process optimization. © 2025 IEEE.
-
Traditional brain tumor diagnosis and classification are time-consuming and heavily reliant on radiologist expertise. The ever-growing patient population generates vast data, rendering existing methods expensive and inefficient. Deep Learning (DL) is a promising approach for developing automated systems to diagnose or segment brain tumors with high accuracy in less time. Within Deep Learning, Convolutional Neural Networks (CNNs) are potent tools for image classification tasks. This is achieved through a series of specialized layers, including convolution layers that identify patterns within images, pooling layers that summarize these patterns, fully connected layers that ultimately classify the image, and a feedforward layer to produce the output class. This study employed a CNN to classify brain tumors in T1-weighted contrast-enhanced images with various image resolutions, including 30×30, 50×50, 70×70, 100×100, and 150×150 pixels. The model successfully distinguished between three tumor types: glioma, meningioma, and pituitary. The CNN's impressive accuracy on training data reached up to 86.38% for image resolution (30×30) and 94.64% for higher resolution (150×150). This indicates its potential as a valuable tool in real-world brain tumor classification tasks. © 2025 IEEE.
-
Metaheuristic methods have demonstrated their utility in tackling global optimization problems with and without constraints. However, existing...
-
Photovoltaic systems have proven to be one of the most widely used renewable energies and the best replacement for conventional energy. Yet, their non-linear nature remains a challenge when it comes to extracting maximum power from photovoltaic modules. Therefore, in this work, a nonlinear PID controller has been used to meet the requirements of the photovoltaic system. In addition, to improve system performance and response, metaheuristic search algorithms were introduced into the tuning process of both the NPID controller and conventional PID controller parameters in order to compare them. The use of Artificial Intelligence to fine-tune the controller parameters will enable the optimum values of proportional, integral, derivative and nonlinear gains to be determined as system condition change. Finally, a comparison between the algorithms applied is conducted in terms of efficiency, rise time, settling time and overshoot as well as the overall system stability.
-
Diabetes, affecting more than 500 million individuals worldwide, is the most widespread non-communicable disease, globally. The early identification and effective management of diabetes are crucial for controlling its spread. Currently, the HbA1c test is the gold standard for the detection of diabetes with high confidence. But this is an invasive, expensive pathology test. Therefore, alternative non-invasive and inexpensive methods have been proposed in the literature for the early detection of diabetes.
-
Obstructive Sleep Apnea (OSA) is a prevalent health issue affecting 10-25% of adults in the United States (US) and is associated with significant economic consequences. Machine learning methods have shown promise in improving the efficiency and accessibility of OSA diagnoses, thus reducing the need for expensive and challenging tests. A comparative analysis of Logistic Regression (LR), Support Vector Machine (SVM), Gradient Boosting (GB), Gaussian Naive Bayes (GNB), Random Forest (RF), and K-Nearest Neighbors (KNN) algorithms was conducted to predict Obstructive Sleep Apnea (OSA). To improve the predictive accuracy of these models, Random Oversampling was applied to address the imbalance in the dataset, ensuring a more equitable representation of the minority class. Patient demographics, including age, sex, height, weight, BMI, neck circumference, and gender, were employed as predictive features in the models. The RFC provided outstanding training and testing accuracies of 87% and 65%, respectively, and a Receiver Operating Characteristic (ROC) score of 87%. The GBC and SVM classifiers also demonstrated good performance on the test dataset. The results of this study show that machine learning techniques may be effectively used to diagnose OSA, with the Random Forest Classifier demonstrating the best results.
-
Background: In the United States, chronic obstructive pulmonary disease (COPD) is a significant cause of mortality. As far as we know, it is a chronic, inflammatory lung condition that cuts off airflow to the lungs. Many symptoms have been reported for such a disease: breathing problems, coughing, wheezing, and mucus production. Patients with COPD might be at risk, since they are more susceptible to heart disease and lung cancer. Methods: This study reviews COPD diagnosis utilizing various machine learning (ML) classifiers, such as Logistic Regression (LR), Gradient Boosting Classifier (GBC), Support Vector Machine (SVM), Gaussian Naïve Bayes (GNB), Random Forest Classifier (RFC), K-Nearest Neighbors Classifier (KNC), Decision Tree (DT), and Artificial Neural Network (ANN). These models were applied to a dataset comprising 1603 patients after being referred for a pulmonary function test. Results: The RFC has achieved superior accuracy, reaching up to 82.06% in training and 70.47% in testing. Furthermore, it achieved a maximum F score in training and testing with an ROC value of 0.0.82. Conclusions: The results obtained with the utilized ML models align with previous work in the field, with accuracies ranging from 67.81% to 82.06% in training and from 66.73% to 71.46% in testing.
-
Meta-heuristic optimization algorithms have become widely used due to their outstanding features, such as gradient-free mechanisms, high flexibility, and great potential for avoiding local optimal solutions. This research explored the grey wolf optimizer (GWO) to find the ideal configuration for a six-element Yagi–Uda antenna. The GWO algorithm adjusted the lengths of the antenna wires and the spacings between them. The goal was to maximize the antenna’s ability to transmit signals (throughput gain). Optimal antenna selection relies on various parameters, including gain, bandwidth, impedance matching, frequency, side-lobe levels, etc. The optimization of a six-element Yagi–Uda antenna presents a challenging engineering design problem due to its multimodal and nonlinear nature. Achieving optimal performance hinges on the intricate interplay between the lengths of the constituent elements and the spacing configurations. To this end, a multiobjective function was adopted to design this antenna. The performance of several meta-heuristic algorithms, including genetic algorithms, biogeography-based optimization, simulated annealing, and grey wolf optimizer, was compared. The GWO-based approach has performed better than its competitors. This optimized antenna design based on GWO reported a gain of 14.21 decibel. Therefore, the GWO-based method optimizes antennas that can be further investigated for other antenna design problems.
-
Across three online studies, we examined the relationship between the Fear of Missing Out (FoMO) and moral cognition and behavior. Study 1 (N = 283) examined whether FoMO influenced moral awareness, judgments, and recalled and predicted behavior of first-person moral violations in either higher or lower social settings. Study 2 (N = 821) examined these relationships in third-person judgments with varying agent identities in relation to the participant (agent = stranger, friend, or someone disliked). Study 3 (N = 604) examined the influence of recalling activities either engaged in or missed out on these relationships. Using the Rubin Causal Model, we created hypothetical randomized experiments from our real-world randomized experimental data with treatment conditions for lower or higher FoMO (median split), matched for relevant covariates, and compared differences in FoMO groups on moral awareness, judgments, and several other behavioral outcomes. Using a randomization-based approach, we examined these relationships with Fisher Tests and computed 95% Fisherian intervals for constant treatment effects consistent with the matched data and the hypothetical FoMO intervention. All three studies provide evidence that FoMO is robustly related to giving less severe judgments of moral violations. Moreover, those with higher FoMO were found to report a greater likelihood of committing moral violations in the past, knowing people who have committed moral violations in the past, being more likely to commit them in the future, and knowing people who are likely to commit moral violations in the future.
-
In this age of technology, building quality software is essential to competing in the business market. One of the major principles required for any quality and business software product for value fulfillment is reliability. Estimating software reliability early during the software development life cycle saves time and money as it prevents spending larger sums fixing a defective software product after deployment. The Software Reliability Growth Model (SRGM) can be used to predict the number of failures that may be encountered during the software testing process. In this paper we explore the advantages of the Grey Wolf Optimization (GWO) algorithm in estimating the SRGM’s parameters with the objective of minimizing the difference between the estimated and the actual number of failures of the software system. We evaluated three different software reliability growth models: the Exponential Model (EXPM), the Power Model (POWM) and the Delayed S-Shaped Model (DSSM). In addition, we used three different datasets to conduct an experimental study in order to show the effectiveness of our approach.
-
Barcode-less fruit recognition technology has revolutionized the checkout process by eliminating manual barcode scanning. This technology automatically identifies and adds fruit items to the purchase list, significantly reducing waiting times at the cash register. Faster checkouts enhance customer convenience and optimize operational efficiency for retailers. Adding barcode to fruits require using adhesives on the fruit surface that may cause health hazards. Leveraging deep learning techniques for barcode-less fruit recognition brings valuable advantages to industries, including advanced automation, enhanced accuracy, and increased efficiency. These benefits translate into improved productivity, cost reduction, and superior quality control. This study introduces a Convolutional Neural Network (CNN) designed explicitly for automatic fruit recognition, even in challenging real-world scenarios. The proposed method assists fruit sellers in accurately identifying and distinguishing between different types of fruit that may exhibit similarities. A dataset that includes 44,406 images of different fruit types is used to train and test our technique. Employing a CNN, the developed model achieves an impressive classification accuracy of 97.4% during the training phase and 88.6% during the testing phase respectively, showcasing its effectiveness in precise fruit recognition.
-
The advancement in treating medical data grows significantly daily. An accurate data classification model can help determine patient disease and diagnose disease severity in the medical domain, thus easing doctors' treatment burdens. Nonetheless, medical data analysis presents challenges due to uncertainty, the correlations between various measurements, and the high dimensionality of the data. These challenges burden statistical classification models. Machine Learning (ML) and data mining approaches have proven effective in recent years in gaining a deeper understanding of the importance of these aspects. This research adopts a well-known supervised learning classification model named a Decision Tree (DT). DT is a typical tree structure consisting of a central node, connected branches, and internal and terminal nodes. In each node, we have a decision to be made, such as in a rule-based system. This type of model helps researchers and physicians better diagnose a disease. To reduce the complexity of the proposed DT, we explored using the Feature Selection (FS) method to design a simpler diagnosis model with fewer factors. This concept will help reduce the data collection stage. A comparative analysis has been conducted between the developed DT and other various ML models, such as Logistic Regression (LR), Support Vector Machine (SVM), and Gaussian Naive Bayes (GNB), to demonstrate the effectiveness of the developed model. The results of the DT model establish a notable accuracy of 93.78% and an ROC value of 0.94, which beats other compared algorithms. The developed DT model provided promising results and can help diagnose heart disease. © 2024, Zarka Private University. All rights reserved.
-
Diabetes mellitus is a chronic disease affecting over 38.4 million adults worldwide. Unfortunately, 8.7 million were undiagnosed. Early detection and diagnosis of diabetes can save millions of people’s lives. Significant benefits can be achieved if we have the means and tools for the early diagnosis and treatment of diabetes since it can reduce the ratio of cardiovascular disease and mortality rate. It is urgently necessary to explore computational methods and machine learning for possible assistance in the diagnosis of diabetes to support physician decisions. This research utilizes machine learning to diagnose diabetes based on several selected features collected from patients. This research provides a complete process for data handling and pre-processing, feature selection, model development, and evaluation. Among the models tested, our results reveal that Random Forest performs best in accuracy (i.e., 0.945%). This emphasizes Random Forest’s efficiency in precisely helping diagnose and reduce the risk of diabetes.
-
The structure of blood vessels in the retina is a crucial factor in identifying and forecasting various eye diseases like cardiovascular diseases, diabetes, and other diseases. Therefore, detecting the structure of blood vessels from retinal fundus images is a critical field of research in healthcare. This study employed a novel deep learning model to segment vessels for different diseases, including Glaucoma, Diabetic Retinopathy (DR), and Age-related Macular Degeneration (AMD). We considered multiple transfer learning-based models and discovered that the ResNet-based U-Net architecture was the most effective for vessel segmentation, achieving the highest Dice Score above 84% for disease-agnostic, and 82%-84% for disease-specific conditions. We believe the proposed methodology will help to advance retinal vessel segmentation process and enhance the screening process of diseases based on retinal fundus images in clinical settings of Qatar Biobank as well as other biobanks across the globe. © 2023 IEEE.
Explore
Department
- Computer Science
- History (1)
- Physics (6)
- Psychology (2)
- Public Health (1)
Resource type
- Book (8)
- Book Section (9)
- Conference Paper (98)
- Journal Article (88)
- Report (13)
Publication year
- Between 1900 and 1999 (47)
-
Between 2000 and 2026
(169)
- Between 2000 and 2009 (28)
- Between 2010 and 2019 (52)
- Between 2020 and 2026 (89)