This study defines a new approach for building a Web Services based infrastructure for distributed data mining applications. The proposed architecture provides a roadmap for "autonomic" functionality of the infrastructure hiding the...
Missing data is very common in survey research. However, currently few guidelines exist with regard to the diagnosis and remedy to missing data in survey research. The goal of this thesis was to investigate properties and effects of three selected...
Brief Overview of the Problem: The Environmental Protection Agency (EPA), a government funded agency, provides both legislative and judicial powers for emissions monitoring in the United States. The agency crafts laws based on self-made regulations...
Data libraries--Security measures; Computer networks--Security measures; Information storage and retrieval systems; Digital preservation
Data centers (DC) are the core of the national cyber infrastructure. With the incredible growth of critical data volumes in financial institutions, government organizations, and global companies, data centers are becoming larger and more...
Swarm Intelligence (SI) techniques were inspired by bee swarms, ant colonies, and most
recently, bird flocks. Flock-based Swarm Intelligence (FSI) has several unique features, namely
decentralized control, collaborative learning, high exploration...
Educational leadership; School improvement programs; School management and organization
This study examines how two schools utilized elements of distributed leadership to implement strategies from a reform intervention for whole school and classroom improvement planning from data. The notion of distributed leadership was refined in a...
Clustered longitudinal data is often collected as repeated measurements on subjects over time arising in the clusters. Examples include longitudinal community intervention studies, or family studies with repeated measures on each member. Meanwhile,...
In this dissertation research, we aim to solve problems of two types of survival data, clustered survival data with potentially informative cluster size and sojourn time data. The methods for these two types of data are different. However, both...
DNA microarrays--Statistical methods; Gene expression--Statistical methods
Data derived from gene expression microarrays are frequently used to identify candidate genes which can characterize and distinguish between two biological phenotypes. A key step in this process is the selection of an appropriate test statistic to...
Data mining; Lungs--Cancer; Outcome assessment (Medical care)
Lung cancer is the leading cause of cancer death in the United States and the world, with more than 1.3 million deaths worldwide per year. However, because of a lack of effective tools to diagnose Lung Cancer, more than half of all cases are...
Many important applications require the discovery of items which have occurred frequently. Knowledge of these items is commonly used in anomaly detection and network monitoring tasks. Effective solutions for this problem focus mainly on reducing...
Bioinformatics; Breast--Cancer--Treatment; Medical care--Data processing
Statistical models have been the first choice for comparative effectiveness in clinical research. Though effective, these models are limited when the data to be analyzed do not fit the assumed distributions; which is mostly the case when the study...
The purpose of this dissertation is to find ways to decrease Medicare costs and to study health outcomes of diabetes patients as well as to investigate the influence of Medicare, part D since its introduction in 2006 using the CMS CCW (Chronic...
Data Classification is a task that could be found in many life activities. In general, the term could be used for any activity that derives some decision or forecast based on the currently available information. Using a more accurate definition, a...
This project will provide a service-oriented architecture to handle sensor data in real time as the information comes in. There are two types of sensors we're implementing into our project, mobile sensors and stationary sensors. These sensors...
Human being can easily acquire information by showing the object than reading the description of it. Our brain stores images that the eyes are seeing and by the brain mapping, people can analyze information by imagination in the brain. This is the...
Medical records--Data processing; Data mining; Medical care--Data processing; Coronary heart disease--Treatment
The goal of this study is to use a data mining framework to assess the three main treatments for acute myocardial infarction: thrombolytic therapy, percutaneous coronary intervention (percutaneous angioplasty), and coronary artery bypass surgery....
An ensemble consists of a set of individual predictors whose predictions are combined. Generally, different classification and regression models tend to work well for different types of data and also, it is usually not know which algorithm will be...
Longitudinal studies occupy an important role in scientific researches and clinical trials. When taking the analysis of longitudinal data, investigators are often confronted with missing data which will produce potential biases, even in...
Heart--Diseases--Patients--Rehabilitation; Data mining
The purpose of this paper is to examine the process of text mining and using the results to show the possible benefits of cardiopulmonary rehabilitation. The 555 patients enrolled in the study were receiving inpatient cardiopulmonary...