New Sources of Data
1 Source of large data
Traditional Data Warehousing
Volume, Velocity, and Variety
Aggregating Data from Different Sources
The challenge for most organizations is to manage and analyze the various sources of structured, structured, and streaming data.
New Trends in Data Organization
Business Analytics (Definition)
Business analytics makes extensive use of statistical analysis, including explanatory and predictive modeling, and fact-based management to drive decision making. It is therefore closely related to management science. Analytics may be used as input for human decisions or may drive fully automated decisions.
Descriptive Analytics
What has occurred?
How much did I sell?
BI, Data engineering, statistics …
Data Engineering and Statistics:
Organize data, execute large queries, describe means, trends, and test hypotheses
Predictive Analytics
What will occur?
Try to understand behaviour. E.g. switching customers
Data Mining and Econometrics
Forecast events, predict time series, or discrete choice decisions of customers
Prescriptive Analytics
What should occur?
Network flow, Management science …
Algorithms and Optimization
Develop algorithms and optimization models for planning, scheduling,
pricing, and revenue mgt.
Relationship to Business Intelligence (BA related to predictive / inductive statistics and BI related to descriptive analytics / statistics)
performance based on data and statistical methods.
* may be used as input for human decisions or may drive fully automated decisions. * Business intelligence (related to descriptive analytics / statistics) * traditionally focuses on using a consistent set of metrics to both measure past performance and guide business planning, which is also based on data and statistical methods. * is often associated with querying, reporting, OLAP, and "alerts".
From Data to Information (Flow)
Predictive Analytics
Numerical prediction
Given a collection of data with known numeric outputs, create a function that outputs a predicted value from a new set of inputs.
E.g. Given gestation time of an animal, predict its maximum life span.
Classification
Clustering
Difference to classification: you do not know the groups
Association Rule Analysis
Market basket analysis (milk, sugar, eggs)
What is a Model?
Mathematical functions
E.g.:
Model selection
Most important algorithms
Examples of Analytics in Retailing
CRM Marketing and examples