标题: diaper and beer reflect the huge data processing time [打印本页] 作者: wangzheng3056 时间: 2013-8-24 10:45 标题: diaper and beer reflect the huge data processing time As information technology continues to evolve, peopleuse information technology to process data of the large increase in capacity,more and more databases are used in business management, production control andengineering design and other areas. However, the face of ever-increasingvariety of complex data, already exists in the database query functions can nolonger meet people's needs, can extract from the data that people needinformation and knowledge is our growing concern. Traditional statisticaltechniques have been facing tremendous challenges, set of statistics,databases, knowledge discovery technology in a data mining technologies haveemerged. In recent years, data mining technology in the retail, directmarketing, manufacturing, finance finance, insurance, communications andmedical services, etc., are widely used. # m; F! o& D+ `( P [0 m9 ^ s
* t# R* a$ L( X: h
First, the basic concepts of data mining : M/ I$ Z" G Y" c7 G7 C+ m% I( X# g9 k' R" u5 B, O$ N
; |& {; W1 q" i$ s- B3 d' P 作者: wangzheng3056 时间: 2013-8-24 10:46
(一) "beer, diaper," a classic case 1 K i7 L' g! E3 `- e+ ]' y) t6 J6 F' z
* I) R3 @, |* V( X8 d. M
In understanding the concept of data mining, we first look at a "beer diaper" story. Wal-Mart in this story the protagonist is the world's largest retailer, in its thousands of supermarkets across the United States, children diapers and beer actually placed side by side together on the prices of goods in the nearby sales, and both have pretty good sales . Wal-Mart through the establishment of the original data warehouse, analysis of the original transaction data, according to product sales cycle, statistical information, and then use data mining tools for analysis and digging and found that Wal-Mart supermarket chains weekends the sales of beer and diapers is very large. Further investigation showed that families with children in the United States, his wife often asked their husbands to work and after going to buy diapers for their children, while their husbands were in the After buying diapers then smoothly back to his own beer drinkers, beer and diapers together with the opportunity to buy the most. After the store to break the routine will be placed on the shelves of beer and diapers together, making beer and diaper sales to grow further. Beer and diapers these two seemingly unrelated, but in certain conditions, there is a close relationship between them, which is data mining techniques. .. $ m# j3 r' r8 s8 l g4 m; S# G* M http://www.itonghui.com ; p7 G7 Y2 B8 l/ N3 t; B( {作者: wangzheng3056 时间: 2013-8-24 10:46
(二) The concept of data mining 2 Z; W9 l" v' D/ U* O% U . K% s1 _# p3 P- `) O) |
Data Mining (Data Mining) is from vast amounts of raw data, to identify implicit in them, however, we do not know, but is potentially significant knowledge and information in order to use this knowledge to guide our activities. From the statistical point of view, data mining can be seen as a large number of the complex through computer automated exploratory data analysis. With the rapid development of information technology, it is a sharp increase in the amount of data accumulated. Data Mining is to comply with this need came into being developed data processing techniques. ' Q7 d2 R5 s! f$ @$ O! Q0 { 5 z1 j& e5 A' H( ~7 r
Second, the retail application data mining background * r# f( f5 O5 f5 y
& r( q( @4 a' \
Retail customer relationship management ((Customer Relationship Management. CRM) is a customer-centric marketing concepts and strategies. CRM objective is to reduce the sales cycle and marketing costs, increase revenue, expand their business needed to find new markets and channels and enhance customer price, satisfaction, profitability and loyalty. retail customer relationship management, primarily through bar codes, sales management system, customer data management system for a variety of ways to obtain information on product information, customer information, supplier information and shops information a lot of data, how to use the mass data analysis of which items will sell, what products do not sell, what customers suitable for what commodities, how the mix between, is to enable retailers headache. use of data mining tools an analysis of these data can help retailers to scientific decision-making, an analysis of what products customers purchase together with the most promising, thus placing these products together; analysis of product sales trends, thus providing retailers purchase recommendations; analysis of purchase commodities, personnel information to help retailers choose the location of the shop and so on. . j- U4 Z8 g9 Y6 q6 ?- Fhttp://www.dmresearch.net/ 7 L$ \7 r' S% O& G 作者: wangzheng3056 时间: 2013-8-24 10:46
3, data mining techniques commonly used algorithm for " ?/ ?. X8 Y6 P; W' }! B( D 4 y$ k$ |' g) P3 s
Data mining is the core technology in the retail CRM, through the analysis of customers who have purchased goods and the intrinsic link between these products to determine the customer's buying habits and tendencies associated with buying, helping retailers to develop marketing strategies. In order to achieve in the retail sector .. CRM application, data mining technology, mainly related to the following commonly used algorithms: .. 4 a4 f) w U. ?$ j9 L作者: wangzheng3056 时间: 2013-8-24 10:46
(一) clustering analysis algorithm 5 n: l' i6 a# H# A
% W( j: ]9 M/ b7 U, wClustering analysis algorithm is based on the characteristics of things, their clustering or classification, that is, the so-called feather flock together, with a view from the laws and the typical patterns found. In the retail sector, the cluster analysis can help the market analysts to distinguish from the consumer database to a different consumer groups, and summarized for each category of consumer spending patterns or habits. .. & t2 e6 p- Z4 j F: O1 e 作者: wangzheng3056 时间: 2013-8-24 10:47
(二) Decision Tree Algorithm + w9 Y1 Z2 t* g+ Z4 T+ O ! t& z. O" {3 r" }4 e# n4 }
Decision tree algorithm is the use of the training set to generate a test function, according to different values of the establishment of a branch of the tree; in each branch of the creation of duplicate subset of the lower nodes and branches, thus generating a decision tree. Then the decision tree for pruning treatment and finally the decision tree into a rule. Decision tree algorithm is commonly used in the prediction model, which has the purpose of large amounts of data by classification, to find some valuable potential information. Classify it fast, especially for large-scale data classification. .6 G2 a6 O) J. N9 ]+ y6 W 作者: wangzheng3056 时间: 2013-8-24 10:47
- @, Z7 f( h% J6 X, m( k! g$ C0 f
(三) neural network algorithm 8 S& m! F9 O7 {: }, n6 H. t
& Y3 V, u0 x: N U- K
Neural network algorithms to simulate the human neuronal function, through the input layer, hidden layer, output layer and so on, to adjust the data to calculate the final yield results. Neural network algorithm has the advantage that it can accurately predict the complex issues. Itself has a good robustness, adaptive and highly fault-tolerant. .. Http://www.itonghui.com & {0 x% x8 M" b0 X 作者: wangzheng3056 时间: 2013-8-24 10:47
(四) association rule mining algorithm ' ^! c0 h% K3 X
! ?" M4 Q/ v( Q' S" zAssociation rule mining is used to detect the correlation between attributes in the database connection algorithm. Association rule discovery is the essence of the task was found in the database, strong association rules, use of these association rules to understand customer behavior, the most typical example is market basket analysis. . p! U5 P4 ?) E+ T) v8 _ / I' X, {1 k/ m7 c, S# F4 V4 E作者: wangzheng3056 时间: 2013-8-24 10:47
4, data mining technology in the retail application 7 m' ~" O8 Z: O8 j+ _: U
: x6 K9 q7 J% PWith the growing rise of Web or e-commerce methods, retail CRM is the main application areas of data mining. Data mining technology can help to identify customer buying behavior, found that customer buying patterns and trends, improve service quality, achieve better customer retention and satisfaction, improve goods sales ratio, design better products transportation and distribution strategy to reduce business costs. Data Mining Application of CRM in the retail industry is mainly reflected in the following areas: 8 ]" D3 J: x5 O3 p/ j4 v0 V4 Z 作者: wangzheng3056 时间: 2013-8-24 10:47
(一) The use of multi-feature data cube for sales, customer, product, time and regional multi-dimensional analysis of # H/ X4 z( Y' n, P3 ?% d0 u9 p
1 m+ L3 i# @4 F1 B
Multidimensional data analysis is the way through the multi-dimensional analysis of the data, query and reporting. Dimension is one specific point of observation data. For example, companies considering the sale of products, usually from the customer, product, time and regional perspective of insight into the different product sales. Here customers, products, time and area is the dimension. According to different combinations of these dimensions and the study of metrics from the customer found in the basic library of different customer base, so that decision-makers according to the characteristics of the main customer base, accordingly, orders, sales and service decision-making. .. Http://www.dmresearch.net/bbs ! Q& y b% d: f0 q4 R + k# `3 [$ t7 i$ e: Q作者: wangzheng3056 时间: 2013-8-24 10:48
(二) the use of correlation analysis of information to make a purchase recommendation for mining association and commodity reference & t" q/ h9 j( g/ W
7 y! E3 }- C/ B0 k$ f
Correlation analysis is the use of association rules, data mining techniques, aimed at the hidden relationships between the data found in the database form, such as .. "90% of customers in a purchasing activity to purchase A purchase of merchandise B products will also "The kind of knowledge. Sales records from the Mining Association information, you can find customers to buy a particular brand is likely to purchase other commodities. Such information can be used to form a definite buy recommendation. Businesses through advocacy to improve services to help customers choose products, increase sales and reduce inventories. ! e/ ]0 @7 I; c, |9 I$ O7 A1 S F1 [, M $ i" |9 p6 `7 b4 g+ ~' E/ Y 作者: wangzheng3056 时间: 2013-8-24 10:48
(三) The use of multi-dimensional analysis and correlation analysis to analyze the effectiveness of promotional activities . Q: e+ h& n8 d- }6 c
& n7 m' _/ q5 q7 m& e( Q) q
The use of multi-dimensional analysis and correlation analysis examined data from the database to analyze customer buying habits, advertising success rate, and other strategic information. By searching the database using a database of sales data in recent years, using multi-dimensional correlation analysis method, by comparing the sales volume of sales and number of transactions during the period and promotional activities before and after the situation, predictable seasonal and monthly sales, variety of goods and inventory trends analysis could also determine the bargains, and the number and operation of the decision-making. Moreover, the correlation analysis can find out which products can be used for promotional activities, to facilitate arrangements for supply of goods and improve sales. & r/ _8 N0 Y8 T2 k5 ^
; O' g% k3 B, D) I 作者: wangzheng3056 时间: 2013-8-24 10:48
(四) sequential pattern mining can be used for customer loyalty analysis 8 }: m: g, V5 B2 _ ; f) B$ p! ], o) J1 ~' w7 sSequential pattern analysis and correlation analysis of similar, but the focus is on analysis of data before and after the sequence of the relationship between. Sequential pattern mining can be used to analyze the customer's loyalty to the changes in consumption or by which the price and variety of goods to be adjusted to retain old customers and attract new customers, to guarantee a certain number of customers. Merchants from the original client but later converted to competitor's customer base, analyze its characteristics, then the results of the analysis to existing customer data to identify possible shift in customers, and then devise methods to prevent the loss of customers; also according to customer's Consumer behavior and transaction records to sort of customer loyalty, according to the level of wastage and thus with different strategies. * c$ I" U1 @9 {! S( e5 R2 e' B2 \http://b2b.itonghui.com/ ( t" h' r! B! E) g' {# p作者: wangzheng3056 时间: 2013-8-24 10:48
(四) sequential pattern mining can be used for customer loyalty analysis 8 g% J( N, Z2 m3 A! |. b. X, s
5 y+ _% k0 X3 _0 ESequential pattern analysis and correlation analysis of similar, but the focus is on analysis of data before and after the sequence of the relationship between. Sequential pattern mining can be used to analyze the customer's loyalty to the changes in consumption or by which the price and variety of goods to be adjusted to retain old customers and attract new customers, to guarantee a certain number of customers. Merchants from the original client but later converted to competitor's customer base, analyze its characteristics, then the results of the analysis to existing customer data to identify possible shift in customers, and then devise methods to prevent the loss of customers; also according to customer's Consumer behavior and transaction records to sort of customer loyalty, according to the level of wastage and thus with different strategies. 6 l5 v# f& b N- D+ i' O. g7 K http://b2b.itonghui.com/ 1 i. E+ O* G: ~) N( [; k 作者: wangzheng3056 时间: 2013-8-24 10:48
(五) the use of cross-selling model to sell existing customers new products or services . D% B) h$ W; A' q5 D J
6 r9 T( s5 O- D' z6 ?2 G7 W
Retail and customer relationship is an ongoing, developmental, and cross-selling is the selling point to customers a new product or service process. Cross-selling is based on buyers and sellers based on the principle of mutual benefit, the customer due to get more and better services to meet their needs benefit, enterprises can also benefit from the result of sales growth. Cross-selling advantage, businesses can more easily get more customers a wealth of information. Enterprise master customer information, especially the information prior to purchase, may be decided that the customers with the next purchase of critical information. At this time reflects the role of data mining, it can help businesses find customer buying behavior that affect the information and factors. : g4 ~. U* Z2 [$ Q作者: WXYINHIT 时间: 2014-1-9 17:26
好顶赞,不明觉厉,不觉明厉