Skip to main content

Annals of Data Science OnlineFirst articles

30.05.2024

Analysis of the HIV/AIDS Data Using Joint Modeling of Longitudinal (k,l)-Inflated Count and Time to Event Data in Clinical Trials

Generalized linear mixed effect models (GLMEMs) are widely applied for the analysis of correlated non-Gaussian data such as those found in longitudinal studies. On the other hand, the Cox (proportional hazards, PHs) and the accelerated failure …

verfasst von:
Mojtaba Zeinali Najafabadi, Ehsan Bahrami Samani

27.05.2024

Omega —Type Probability Models: A Parametric Modification of Probability Distributions

A mathematical approach to developing new distributions is reviewed. The method which composes of integration and the concept of a normalizing constant, allows for primitive interjection of new parameter(s) in an existing distribution to form new …

verfasst von:
Udochukwu Victor Echebiri, Nosakhare Liberty Osawe, Chukwuemeka Thomas Onyia

25.05.2024

UAV-YOLOv5: A Swin-Transformer-Enabled Small Object Detection Model for Long-Range UAV Images

This paper tackle the challenges associated with low recognition accuracy and the detection of occlusions when identifying long-range and diminutive targets (such as UAVs). We introduce a sophisticated detection framework named UAV-YOLOv5, which …

verfasst von:
Jun Li, Chong Xie, Sizheng Wu, Yawei Ren

23.05.2024

A Survey of Artificial Intelligence for Industrial Detection

In the past decade, deep learning has greatly increased the complexity of industrial production intelligence by virtue of its powerful learning capability. At the same time, it has also brought security challenges to the field of industrial …

verfasst von:
Jun Li, YiFei Hai, SongJia Yin

23.05.2024

A Deep Convolutional Neural Network-Based Approach for Visual Search & Recommendation of Grocery Products

Search and recommendation are two essential features of any e-commerce website for finding and purchasing a specific product. Visual Search is a promising and quick method in comparison to a textual-based search method. Hence, the objective of …

verfasst von:
Nawreen Anan Khandaker, Amrin Rahman, Amrin Akter Pinky, Tasmiah Tamzid Anannya

21.05.2024

Combining Nonlinear Features of EEG and MRI to Diagnose Alzheimer’s Disease

This article, a new method for the diagnosis of Alzheimer’s disease in the mild stage is presented according to combining the characteristics of EEG signal and MRI images. The brain signal is recorded in four modes of closed-eyes, open eye …

verfasst von:
Elias Mazrooei Rad, Mahdi Azarnoosh, Majid Ghoshuni, Mohammad Mahdi Khalilzadeh

20.05.2024

Spatial Data Analysis for Robust Classification of Network Topology Through Synthetic Combinatorics

The measurement of network topology through various spatial topological indices like Alpha, Beta and Gamma are widely used for spatial data analysis. However, explaining the classification of the network topology of a city based on Alpha, Beta and …

verfasst von:
Samrat Hore, Stabak Roy, Malabika Boruah, Saptarshi Mitra

20.05.2024

Evaluating the Performance of Machine Learning Algorithm for Classification of Safer Sexual Negotiation among Married Women in Bangladesh

Safer sexual practice is essential for improving women’s reproductive and sexual health outcomes. The goal of this study is to identify the contributing factors influencing safer sexual negotiations (SSN) through the application of machine …

verfasst von:
Md. Mizanur Rahman, Deluar J. Moloy, Mashfiqul Huq Chowdhury, Arzo Ahmed, Taksina Kabir

11.05.2024

Unified Image Harmonization with Region Augmented Attention Normalization

The image harmonization task endeavors to adjust foreground information within an image synthesis process to achieve visual consistency by leveraging background information. In academic research, this task conventionally involves the utilization …

verfasst von:
Junjie Hou, Yuqi Zhang, Duo Su

18.04.2024

Half Logistic Generalized Rayleigh Distribution for Modeling Hydrological Data

This article introduced a three-parameter extension of the Generalized Rayleigh distribution called half-logistic Generalized Rayleigh distribution, which has submodels the Generalized Rayleigh and Rayleigh distribution. The proposed model is …

verfasst von:
Adebisi A. Ogunde, Subhankar Dutta, Ehab M. Almetawally

17.04.2024

An Improved Boosting Bald Eagle Search Algorithm with Improved African Vultures Optimization Algorithm for Data Clustering

Data clustering is one of the main issues in the optimization problem. It is the process of clustering a group of items into several groups. Items within each group have the greatest similarity and the least similarity to things in other groups.

verfasst von:
Farhad Soleimanian Gharehchopogh

17.04.2024

One-Inflated Zero-Truncated Poisson Distribution: Statistical Properties and Real Life Applications

Agriculture, engineering, public health, sociology, psychology, and epidemiology are just few of the numerous disciplines that find analysis and modeling of zero-truncated count data to be of paramount importance. Very recently, researchers have …

verfasst von:
Mohammad Kafeel Wani, Peer Bilal Ahmad

30.03.2024

Optimal Strategy for Elevated Estimation of Population Mean in Stratified Random Sampling under Linear Cost Function

In this paper, we propose the exponential ratio-type estimator for the elevated estimation of population mean, implying one auxiliary variable in stratified random sampling using the conventional ratio and, Bahl and Tuteja exponential ratio-type …

verfasst von:
Subhash Kumar Yadav, Mukesh Kumar Verma, Rahul Varshney

20.03.2024

Optimal Key Generation for Privacy Preservation in Big Data Applications Based on the Marine Predator Whale Optimization Algorithm

In the era of big data, preserving data privacy has become paramount due to the sheer volume and sensitivity of the information being processed. This research is dedicated to safeguarding data privacy through a novel data sanitization approach …

verfasst von:
Poonam Samir Jadhav, Gautam M. Borkar

19.03.2024

Semiparametric Regression Analysis of Panel Count Data with Multiple Modes of Recurrence

Panel count data refers to the information collected in studies focusing on recurrent events, where subjects are observed only at specific time points. If these study subjects are exposed to recurrent events of several types, we obtain panel count …

verfasst von:
Mathew P. M. Ashlin, P. G. Sankaran, E. P. Sreedevi

08.03.2024

Applying BERT-Based NLP for Automated Resume Screening and Candidate Ranking

In this research, we introduce an innovative automated resume screening approach that leverages advanced Natural Language Processing (NLP) technology, specifically the Bidirectional Encoder Representations from Transformers (BERT) language model …

verfasst von:
Asmita Deshmukh, Anjali Raut

27.02.2024

Bayesian Inference for the Entropy of the Rayleigh Model Based on Ordered Ranked Set Sampling

Recently, ranked set samples schemes have become quite popular in reliability analysis and life-testing problems. Based on ordered ranked set sample, the Bayesian estimators and credible intervals for the entropy of the Rayleigh model are studied …

verfasst von:
Mohammed S. Kotb, Haidy A. Newer, Marwa M. Mohie El-Din

27.02.2024

A Joint Cognitive Latent Variable Model for Binary Decision-making Tasks and Reaction Time Outcomes

Traditionally, in cognitive modeling for binary decision-making tasks, stochastic differential equations, particularly a family of diffusion decision models, are applied. These models suffer from difficulties in parameter estimation and …

verfasst von:
Mahdi Mollakazemiha, Ehsan Bahrami Samani

15.02.2024

A New Hyperbolic Tangent Family of Distributions: Properties and Applications

This paper introduces a new family of distributions called the hyperbolic tangent (HT) family. The cumulative distribution function of this model is defined using the standard hyperbolic tangent function. The fundamental properties of the …

verfasst von:
Shahid Mohammad, Isabel Mendoza

Open Access 14.02.2024

Assessing the Risk of Bitcoin Futures Market: New Evidence

The main objective of this paper is to forecast the realized volatility (RV) of Bitcoin futures (BTCF) market. To serve our purpose, we propose an augmented heterogenous autoregressive (HAR) model to consider the information on time-varying jumps …

verfasst von:
Anupam Dutta