The topic of this dissertation is the automation of the process of extracting understandable patterns and rules from data. An unprecedented amount of data is available to anyone with a computer connected to the Internet. The disciplines of Data...
Missing data is very common in survey research. However, currently few guidelines exist with regard to the diagnosis and remedy to missing data in survey research. The goal of this thesis was to investigate properties and effects of three selected...
Archaeologists use soil analysis to detect chemicals, like phosphate, to indicate areas of anthropogenic activity. Phosphate detection is a multi-step process, which makes standard techniques time consuming. Kinetic studies decreased the analysis...
The area of Knowledge discovery and data mining is growing rapidly. Feature Discretization is a crucial issue in Knowledge Discovery in Databases (KDD), or Data Mining because most data sets used in real world applications have features with...
Three-dimensional imaging in medicine; Lungs--Cancer--Diagnosis
Many lung diseases or injuries can cause biomechanical or material property changes that can alter
lung function. While the mechanical changes associated with the change of the material properties
originate at a regional level, they remain largely...
DNA microarrays--Statistical methods; Gene expression--Statistical methods
Data derived from gene expression microarrays are frequently used to identify candidate genes which can characterize and distinguish between two biological phenotypes. A key step in this process is the selection of an appropriate test statistic to...
United Parcel Service; Aeronautics, Commercial--Freight--Data processing; Forecasting--Data processing
This thesis develops a forecasting model to predict six different volume measures on a weekly and daily basis for UPS-Supply Chain Solutions (UPS-SCS). The volume measures are used by UPS-SCS to develop business plans, operation plans, and staffing...
The focus of this dissertation work is the formulation and improvement of anemia management process involving trial-and-error. A two-stage method is adopted toward this objective. Given a medical treatment process, a discrete Markov representation...
Bioinformatics; Breast--Cancer--Treatment; Medical care--Data processing
Statistical models have been the first choice for comparative effectiveness in clinical research. Though effective, these models are limited when the data to be analyzed do not fit the assumed distributions; which is mostly the case when the study...
The advent of larger storage spaces, affordable digital capturing devices, and an ever growing online community dedicated to sharing images has created a great need for efficient analysis methods. In fact, analyzing images for the purpose of...
Youth--Employment; Teenagers--Health and hygiene; Industrial safety; Safety education, Industrial
Teenaged workers are twice as likely to be injured on the job as adult workers, and face a
number of differences developmentally and psychosocially that present challenges for
their safety at work. Little research has focused on the tasks that...
Tick-borne diseases; Health behavior; Health attitudes; Health education--Social aspects
Human monocytic ehrlichiosis (HME), a tick-borne disease that has recently surfaced in
the United States, exists in regions where the tick vector population is established. This
study utilizes methods that look beyond identifying high-risk regions,...
Writing centers; English language--Rhetoric--Study and teaching (Higher); Oral reading
Reading aloud in writing center sessions is a common practice, one that is both under-studied and under-theorized. In an attempt to begin to address this gap, this dissertation conducts an empirical analysis of three different methods of reading...
New products--Management; Consumer satisfaction; Marketing--Management
A constant flow of innovative products which meets the needs of customers and therefore is a monetary success for the inventing organization is important for the long term success of organizations, especially in modem dynamic markets. As resources...
A common research interest in medical, biological, and engineering research is determining whether certain independent variables are correlated with the survival or failure times. Standard statistical techniques cannot usually be applied for...
Pattern perception--Data processing; Pattern recognition systems; Land mines--Detection; Data mining
For complex detection and classification problems, involving data with large intra-class variations and noisy inputs, no single source of information can provide a satisfactory solution. As a result, combination of multiple classifiers is playing...
Pattern recognition systems; Land mines--Detection; Pattern perception--Data processing; Data mining
Traditional machine learning and pattern recognition systems use a feature descriptor to describe the sensor data and a particular classifier (also called "expert" or "learner") to determine the true class of a given pattern....
Few previous studies have compared microbial communities in subterranean and surface environments. Chemical analyses used to characterize the surface and cave microbial environments indicated limited exchange between surface and subsurface waters....
Image processing; Computer vision; Pattern recognition systems
Image segmentation is one of the most important problems in image processing, object recognition, computer vision, medical imaging, etc. In general, the objective of the segmentation is to partition the image into the meaningful areas using the...
Representing the complex data in a concise and accurate way is a special stage in data mining methodology. Redundant and noisy data affects generalization power of any classification algorithm, undermines the results of any clustering algorithm and...