Data mining is a huge industry that intrudes in the lives of society by gathering information on individuals legally but without knowledge or consent.

With recent technical advances in processing power, storage capacity, and inter-connectivity of computer technology, Computer science data mining essay mining is seen as an increasingly important tool by modern business to transform unprecedented quantities of digital data into business intelligence giving an informational advantage.

It is currently used in a wide range of profiling practices, such as marketing, surveillance, fraud detection and scientific discovery. The growing consensus that data mining can bring real value has led to an explosion in demand for novel data mining technologies. Used to remove noise and inconsistent data.

It is used where the multiple data sources may be combined. IN this the data relevant to the analysis task are retrieved. It is where the data are transformed or consolidated into forms appropriate for mining by performing summery and aggregate operation.

It is essential process where intelligent methods are applied in order to extract data patterns. It is to identify the truly interesting patterns representing knowledge based on some interestingness measures.

It is where visualization and knowledge representation techniques are used to present the mined knowledge to the user. It is of interest to researchers in machine learning, pattern recognition, databases, statistics, artificial intelligence, knowledge achievement for expert systems, and data revelation.

The unifying goal of the KDD process is to extract knowledge from data in the context of large database. It does this by using data mining methods algorithms to extract identify what is deemed knowledge, according to the specifications of measures and thresholds, using a database along with any required preprocessing, sub sampling, and transformations of that database Data mining Model Techniques of data mining There are several major data mining techniques have been developed and used in data mining projects recently including association, classification, clustering, regression,prediction and sequential patterns.

Association Searche for relationship between variables. For example a supermarket might gather data on customer purchasing habits. Using association rule learning, the supermarket can determine which products are frequently bought together and use this information for marketing purposes.

This is sometimes called as market basket analysis. Classification Classification is the task of generalizing known structure to apply to new data. For example, an email program might attempt to classify an email as legitimate or spam. Common algorithms include decision tree learning, nearest neighbor, neural networks and support vector machines.

Clustering Clustering is the assignment of a set of observations into subsets so that observations in the same cluster are similar in some sense. Clustering is a method of unsupervised learningand a common technique for statistical data analysis used in many fieldsincluding machine learningdata mining, pattern recognition, image analysis, information retrieval and bioinformatics.

Regression Regression is a data mining function that predicts a number, Profit, saleshouse value, square footage, temperature, or distance could all be predicted using regression technique.

For example, a regression model could be used to predict the value of house based on location, number of rooms, lot size, and other factor. Regression models are tested by computing various statistics that measure the difference between the predicted and the expected values.

The historical data for a regression project a typically divided into two data sets: A regression models the past relationship between variables to predict their future behavior.

When one independent variable is used in a regression, it is called a simple regression 2. Sequential Pattern Sequence analysis is concerned with a subsequent purchase of a product or products given a previous buy.

For instance, buying an extended warranty is more likely to follow the purchase of TV or other electric appliances. There is a wide range of applications for sequence analysis in many areas of industry including customer shopping patterns, phone call patterns and web log streams.

Management Information System MIS is a system that provides information needed to manage organizations effectively. Management information system are regarded to be subset of the overall internal controls procedures in a business ,which cover the application of people, documents, technologies and procedures used by management accountants to solve business problems such as costing the productservice or a business- wide strategy.

