Organization Prerequisites
Data Procurement
means getting data
Privacy
data can reveal private information when the datasets are analyzed jointly
telemetry data
sending data methods, car diagnosis, fit bit
security
Securing Big Data involves ensuring that the data networks and repositories are sufficiently secured via
authentication and authorization mechanisms.
provenance
Limited Realtime Support
Approaches that achieve near-realtime
results often process transactional data as it arrives and combine it with previously summarized batch-processed data.
Distinct Performance Challenges
Distinct Governance Requirements
Distinct Methodology
A methodology will be required to control how data flows into and out of Big Data solutions
Clouds
clouds provide remote environments that can host IT
infrastructure for large-scale storage and processing, among other things
Big Data analytics lifecycle
At different stages in the analytics lifecycle,
data will be in different states, which are __, ___, ___
data-in-motion (transmitted)
data-in-use (processed)
data-at-rest (storage)
Business Case Evaluation
Data Identification
Identifying a wider variety of data sources
Data Acquisition & Filtering
Data Extraction
is dedicated to extracting disparate data and transforming it
into a format that the underlying Big Data solution
Data Validation & Cleansing
dedicated to establishing often complex validation rules and removing any known invalid data
Data Aggregation & Representation
is dedicated to integrating multiple datasets together using common fields
Confirmatory data analysis
deductive approach where the cause of the phenomenon being investigated is proposed beforehand.
The proposed cause or assumption is called a hypothesis.
The data is then analyzed to prove or disprove the hypothesis and provide definitive answers to specific questions
Exploratory data analysis
is an inductive approach that is closely associated with data mining.
No hypothesis or predetermined assumptions are generated.
it may not provide definitive answers