Enhancing Fuzzy Associative Rule Mining Approaches for Improving Prediction Accuracy. Integration of Fuzzy Clustering, Apriori and Multiple Support Approaches to Develop an Associative Classification Rule Base
AuthorSowan, Bilal I.
SupervisorDahal, Keshav P.
Hossain, M. Alamgir
Fuzzy associative rule mining
Associative classification rule base
Decision support system
Minimizing prediction error
The University of Bradford theses are licenced under a Creative Commons Licence.
InstitutionUniversity of Bradford
DepartmentSchool of Computing, Informatics & Media
MetadataShow full item record
AbstractBuilding an accurate and reliable model for prediction for different application domains, is one of the most significant challenges in knowledge discovery and data mining. This thesis focuses on building and enhancing a generic predictive model for estimating a future value by extracting association rules (knowledge) from a quantitative database. This model is applied to several data sets obtained from different benchmark problems, and the results are evaluated through extensive experimental tests. The thesis presents an incremental development process for the prediction model with three stages. Firstly, a Knowledge Discovery (KD) model is proposed by integrating Fuzzy C-Means (FCM) with Apriori approach to extract Fuzzy Association Rules (FARs) from a database for building a Knowledge Base (KB) to predict a future value. The KD model has been tested with two road-traffic data sets. Secondly, the initial model has been further developed by including a diversification method in order to improve a reliable FARs to find out the best and representative rules. The resulting Diverse Fuzzy Rule Base (DFRB) maintains high quality and diverse FARs offering a more reliable and generic model. The model uses FCM to transform quantitative data into fuzzy ones, while a Multiple Support Apriori (MSapriori) algorithm is adapted to extract the FARs from fuzzy data. The correlation values for these FARs are calculated, and an efficient orientation for filtering FARs is performed as a post-processing method. The FARs diversity is maintained through the clustering of FARs, based on the concept of the sharing function technique used in multi-objectives optimization. The best and the most diverse FARs are obtained as the DFRB to utilise within the Fuzzy Inference System (FIS) for prediction. The third stage of development proposes a hybrid prediction model called Fuzzy Associative Classification Rule Mining (FACRM) model. This model integrates the ii improved Gustafson-Kessel (G-K) algorithm, the proposed Fuzzy Associative Classification Rules (FACR) algorithm and the proposed diversification method. The improved G-K algorithm transforms quantitative data into fuzzy data, while the FACR generate significant rules (Fuzzy Classification Association Rules (FCARs)) by employing the improved multiple support threshold, associative classification and vertical scanning format approaches. These FCARs are then filtered by calculating the correlation value and the distance between them. The advantage of the proposed FACRM model is to build a generalized prediction model, able to deal with different application domains. The validation of the FACRM model is conducted using different benchmark data sets from the University of California, Irvine (UCI) of machine learning and KEEL (Knowledge Extraction based on Evolutionary Learning) repositories, and the results of the proposed FACRM are also compared with other existing prediction models. The experimental results show that the error rate and generalization performance of the proposed model is better in the majority of data sets with respect to the commonly used models. A new method for feature selection entitled Weighting Feature Selection (WFS) is also proposed. The WFS method aims to improve the performance of FACRM model. The prediction performance is improved by minimizing the prediction error and reducing the number of generated rules. The prediction results of FACRM by employing WFS have been compared with that of FACRM and Stepwise Regression (SR) models for different data sets. The performance analysis and comparative study show that the proposed prediction model provides an effective approach that can be used within a decision support system.
Showing items related by title, author, creator and subject.
Prediction of natural frequencies of turbine blades for turbocharger application. An investigation of the finite element method, mathematical modelling and frequency survey methods applied to turbocharger blade vibration in order to predict natural frequencies of turbocharger blades.Olley, Peter; Zdunek, Agnieszka Izabela (University of BradfordSchool of Engineering and Informatics, 2015-07-03)Methods of determining natural frequencies of the D76D88, B76D88, A86E93, C86G90, C86L90 and C125L89 turbine wheel designs for various environmental conditions were investigated by application of Finite Element Analysis and beam theory. Modelling and simulation methods were developed ; the first method composed of 15 finite element simulations ; the second composed of 15 finite element simulations and a set of experimental frequency survey results; the third composed of 5 simulations , an incorporated mathematical model and a set of experimental frequency survey results. Each of these methods was designed to allow prediction of resonant frequency changes across a range of exhaust gas temperature and shaft rotational speed. For the new modelling and simulation methods, an analysis template and a plotting tool were developed using Microsoft Excel and MATLAB software. A graph showing a frequency-temperature-speed variations and a Campbell Diagram that incorporates material stiffening and softening effects across a range of rotational speeds was designed, and applied to the D76D88, B76D88, A86E93, C86G90, C86L90 and C125L89 turbine wheel designs. New design methodologies for turbine wheels were formulated and validated, showing a good agreement with a range of data points from frequency survey, strain-gauge telemetry and laser tip-timing test results. The results from the new design method were compared with existing single compensation factor methodology, and showed a great improvement in accuracy of prediction of modal vibration. A new nomenclature for the mode shapes of a turbocharger’s blade was proposed, designed and demonstrated to allow direct identification of associated mode shape. It is concluded that Finite Element Analysis combined with the frequency survey is capable of predicting changes in turbine natural frequencies and, when incorporated into the existing turbine design methodology, resulted in a major improvement in the accuracy of the predictions of vibration frequency.
Automated Solar Activity Prediction: A hybrid computer platform using machine learning and solar imaging for automated prediction of solar flaresColak, Tufan; Qahwaji, Rami S.R. (AGU, 04/06/2009)The importance of real-time processing of solar data especially for space weather applications is increasing continuously. In this paper, we present an automated hybrid computer platform for the short-term prediction of significant solar flares using SOHO/Michelson Doppler Imager images. This platform is called the Automated Solar Activity Prediction tool (ASAP). This system integrates image processing and machine learning to deliver these predictions. A machine learning-based system is designed to analyze years of sunspot and flare data to create associations that can be represented using computer-based learning rules. An imaging-based real-time system that provides automated detection, grouping, and then classification of recent sunspots based on the McIntosh classification is also created and integrated within this system. The properties of the sunspot regions are extracted automatically by the imaging system and processed using the machine learning rules to generate the real-time predictions. Several performance measurement criteria are used and the results are provided in this paper. Also, quadratic score is used to compare the prediction results of ASAP with NOAA Space Weather Prediction Center (SWPC) between 1999 and 2002, and it is shown that ASAP generates more accurate predictions compared to SWPC.
Predicting spontaneous racemate resolution using recent developments in crystal structure predictionKendrick, John; Gourlay, Matthew D.; Neumann, M.A.; Leusen, Frank J.J. (2009)A hybrid molecular mechanics and quantum mechanics solid state DFT method is used to re-rank the stability of racemic and enantiopure crystal structures of four molecules; 4-hydroxymethyl-2-oxazolidinone, 5-hydroxymethyl-2-oxazolidinone, 2-(4-hydroxyphenyl)-2,5,5-trimethylpyrrolidine-1-oxy and 2-(3-hydroxyphenyl)-2,5,5-trimethylpyrrolidine-1-oxy. Previous work using a force field based method to predict these crystal structures indicated that the lattice energy may be a suitable criterion for predicting whether a chiral molecule will resolve spontaneously on crystallisation. However, in some cases, the method had predicted an unrealistically high lattice energy for the structure corresponding to the experimentally observed one. The Hybrid DFT method successfully predicts those molecules which resolve spontaneously and furthermore predicts satisfactory lattice energies for all experimentally observed structures. Based on a comparison of the predicted lattice energies from the two methods it is concluded that the force fields used were not sufficiently accurate to predict spontaneous resolution with any confidence. However, the Hybrid DFT method is shown to be sufficiently accurate for making such predictions.