I've attached the BDA PDF covering the updated data pipeline and visualization requirements. It includes: Scalability Metrics: Data storage and mining performance.
Applying quantitative modeling to solve real-world problems.
Instead of chasing a static PDF, try these resources: