(Big-)Data Architecture (Re-)Invented| Tài liệu tham khảo môn quản trị dữ liệu và trực quan hóa| Trường Đại học Bách Khoa Hà Nội

What is Big Data?
• A collection of data sets so large and complex that it becomes difficult to process using on-hand database management tools or traditional data processing applications
• Due to its technical nature, the same challenges arise in Analytics at much lower volumes than what is traditionally considered Big Data.

Thông tin:
85 trang 3 tháng trước

Bình luận

Vui lòng đăng nhập hoặc đăng ký để gửi bình luận.

(Big-)Data Architecture (Re-)Invented| Tài liệu tham khảo môn quản trị dữ liệu và trực quan hóa| Trường Đại học Bách Khoa Hà Nội

What is Big Data?
• A collection of data sets so large and complex that it becomes difficult to process using on-hand database management tools or traditional data processing applications
• Due to its technical nature, the same challenges arise in Analytics at much lower volumes than what is traditionally considered Big Data.

23 12 lượt tải Tải xuống
(Big-)Data Architecture (Re-)Invented
Part 1: Hadoop and Data Lake
William El Kaim
May 2018 V 4.0
This Presentation is part of the
Enterprise Architecture Digital Codex
http://www.eacodex.com/
2
Copyright © William El Kaim 2018
Taming The Data Deluge
What is Big Data?
Why Now?
What is Hadoop?
What is Hadoop Data Lake?
When to use Hadoop?
Getting Started with Big Data
3
Copyright © William El Kaim 2018
Taming the Data Deluge (2017)
Source: LuceMedia
4
Copyright © William El Kaim 2018
Taming the Data Deluge
5
Copyright © William El Kaim 2018
Taming the Data Deluge
6
Copyright © William El Kaim 2018
Taming The Data Deluge
What is Big Data?
Why Now?
What is Hadoop?
What is Hadoop Data Lake?
When to use Hadoop?
Getting Started with Big Data
7
Copyright © William El Kaim 2018
What is Big Data?
A collection of data sets so large and complex that it becomes difficult to
process using on-hand database management tools or traditional data
processing applications
Due to its technical nature, the same challenges arise in Analytics at much lower
volumes than what is traditionally considered Big Data.
Other definitions
When the data could not fit in Excel
Used to be 65,536 lines, Now 1,048,577 lines
When it's cheaper to keep everything than spend the effort to decide what to throw away
(David Brower @dbrower)
Copyright © William El Kaim 2018
8
Source: SiSense
What is Big Data?
Source: James Higginbotham
6
9
Copyright © William El Kaim 2018
Visualization
The “Vs” to Nirvana
Source: Bernard Marr
10
Copyright © William El Kaim 2018
The “Vs” to Nirvana
Source: IBM
11
Copyright © William El Kaim 2018
The “Vs” to Nirvana
Source: IBM
12
Copyright © William El Kaim 2018
The “Vs” to Nirvana
Source: IBM
13
Copyright © William El Kaim 2018
The “Vs” to Nirvana
Source: IBM
14
Copyright © William El Kaim 2018
The “Vs” to Nirvana
Source: Bernard Marr
Visualization
15
Copyright © William El Kaim 2018
The “Vs” to Nirvana
Source: M-Brain
16
Copyright © William El Kaim 2018
“Big Data” Landscape
Copyright © William El Kaim 2018
17
Taming The Data Deluge
What is Big Data?
Why Now?
What is Hadoop?
What is Hadoop Data Lake?
When to use Hadoop?
Getting Started with Big Data
18
Copyright © William El Kaim 2018
Why Now?
19
Copyright © William El Kaim 2018
Why Now? Datafication of the World
Source: Bernard Marr
20
Copyright © William El Kaim 2018
| 1/85

Preview text:

(Big-)Data Architecture (Re-)Invented
Part 1: Hadoop and Data Lake
William El Kaim May 2018 – V 4.0
This Presentation is part of the
Enterprise Architecture Digital Codex http://www.eacodex.com/
Copyright © William El Kaim 2018 2 • Taming The Data Deluge What is Big Data? Why Now? What is Hadoop?
What is Hadoop Data Lake? When to use Hadoop?
Getting Started with Big Data
Copyright © William El Kaim 2018 3
Taming the Data Deluge (2017)
Copyright © William El Kaim 2018 Source: LuceMedia 4 Taming the Data Deluge
Copyright © William El Kaim 2018 5 Taming the Data Deluge
Copyright © William El Kaim 2018 6 • Taming The Data Deluge What is Big Data? Why Now? What is Hadoop?
What is Hadoop Data Lake? When to use Hadoop?
Getting Started with Big Data
Copyright © William El Kaim 2018 7 What is Big Data?
• A collection of data sets so large and complex that it becomes difficult to
process using on-hand database management tools or traditional data processing applications
• Due to its technical nature, the same challenges arise in Analytics at much lower
volumes than what is traditionally considered Big Data. • Other definitions
• When the data could not fit in Excel
• Used to be 65,536 lines, Now 1,048,577 lines
• When it's cheaper to keep everything than spend the effort to decide what to throw away (David Brower @dbrower)
Copyright © William El Kaim 2018 Source: SiSense 8 What is Big Data? 6 Visualization
Copyright © William El Kaim 2018 Source: James Higginbotham 9 The “Vs” to Nirvana
Copyright © William El Kaim 2018 Source: Bernard Marr 10 The “Vs” to Nirvana
Copyright © William El Kaim 2018 Source: IBM 11 The “Vs” to Nirvana
Copyright © William El Kaim 2018 Source: IBM 12 The “Vs” to Nirvana
Copyright © William El Kaim 2018 Source: IBM 13 The “Vs” to Nirvana
Copyright © William El Kaim 2018 Source: IBM 14 The “Vs” to Nirvana Visualization
Copyright © William El Kaim 2018 Source: Bernard Marr 15 The “Vs” to Nirvana
Copyright © William El Kaim 2018 Source: M-Brain 16
“Big Data” Landscape
Copyright © William El Kaim 2018 17 • Taming The Data Deluge What is Big Data? Why Now? What is Hadoop?
What is Hadoop Data Lake? When to use Hadoop?
Getting Started with Big Data
Copyright © William El Kaim 2018 18 Why Now?
Copyright © William El Kaim 2018 19
Why Now? Datafication of the World Source: Bernard Marr
Copyright © William El Kaim 2018 20