-
Thông tin
-
Hỏi đáp
(Big-)Data Architecture (Re-)Invented| Tài liệu tham khảo môn quản trị dữ liệu và trực quan hóa| Trường Đại học Bách Khoa Hà Nội
What is Big Data?
• A collection of data sets so large and complex that it becomes difficult to process using on-hand database management tools or traditional data processing applications
• Due to its technical nature, the same challenges arise in Analytics at much lower volumes than what is traditionally considered Big Data.
Môn: Quản trị dữ liệu và trực quan hóa
Trường: Đại học Bách Khoa Hà Nội
Thông tin:
Tác giả:
Preview text:
(Big-)Data Architecture (Re-)Invented
Part 1: Hadoop and Data Lake William El Kaim May 2018 – V 4.0
This Presentation is part of the
Enterprise Architecture Digital Codex http://www.eacodex.com/
Copyright © William El Kaim 2018 2 • Taming The Data Deluge • What is Big Data? • Why Now? • What is Hadoop?
• What is Hadoop Data Lake? • When to use Hadoop?
• Getting Started with Big Data
Copyright © William El Kaim 2018 3
Taming the Data Deluge (2017)
Copyright © William El Kaim 2018 Source: LuceMedia 4 Taming the Data Deluge
Copyright © William El Kaim 2018 5 Taming the Data Deluge
Copyright © William El Kaim 2018 6 • Taming The Data Deluge • What is Big Data? • Why Now? • What is Hadoop?
• What is Hadoop Data Lake? • When to use Hadoop?
• Getting Started with Big Data
Copyright © William El Kaim 2018 7 What is Big Data?
• A collection of data sets so large and complex that it becomes difficult to
process using on-hand database management tools or traditional data processing applications
• Due to its technical nature, the same challenges arise in Analytics at much lower
volumes than what is traditionally considered Big Data. • Other definitions
• When the data could not fit in Excel
• Used to be 65,536 lines, Now 1,048,577 lines
• When it's cheaper to keep everything than spend the effort to decide what to throw away (David Brower @dbrower)
Copyright © William El Kaim 2018 Source: SiSense 8 What is Big Data? 6 Visualization
Copyright © William El Kaim 2018 Source: James Higginbotham 9 The “Vs” to Nirvana
Copyright © William El Kaim 2018 Source: Bernard Marr 10 The “Vs” to Nirvana
Copyright © William El Kaim 2018 Source: IBM 11 The “Vs” to Nirvana
Copyright © William El Kaim 2018 Source: IBM 12 The “Vs” to Nirvana
Copyright © William El Kaim 2018 Source: IBM 13 The “Vs” to Nirvana
Copyright © William El Kaim 2018 Source: IBM 14 The “Vs” to Nirvana Visualization
Copyright © William El Kaim 2018 Source: Bernard Marr 15 The “Vs” to Nirvana
Copyright © William El Kaim 2018 Source: M-Brain 16
“Big Data” Landscape
Copyright © William El Kaim 2018 17 • Taming The Data Deluge • What is Big Data? • Why Now? • What is Hadoop?
• What is Hadoop Data Lake? • When to use Hadoop?
• Getting Started with Big Data
Copyright © William El Kaim 2018 18 Why Now?
Copyright © William El Kaim 2018 19
Why Now? Datafication of the World Source: Bernard Marr
Copyright © William El Kaim 2018 20