| |
BIG DATA, LOW LATENCY
"The typical CIO understands the security and privacy implications of the growing digital universe but is not sure how to get the rest of the company to understand them" - IDC/EMC
"The sheer size of today’s data banks means that companies need to be more careful than ever to treat data as a slave rather than a master." - Economist
| |
| THE STATE OF BIG DATA > |
| 2.5 QUINTILLION |
- 2.5 quintillion bytes is the amount of data we are producing every day [4]
|
| $6 TRILLION |
|
| $650 BILLION |
- $650 Billion / year: cost of wasted productivity because of Information overload.
|
| $6.65 MILLION |
- 2008 average organizational cost of data breach [2]
|
| $202 |
- Estimated cost per customer record compromised in a security breach to companies (2008-2009 figure) [2]
|
| 1ZB |
- 1 Zettabyte: Estimated Internet Traffic by 2015 according to one study [3]
|
| 1800EB |
- 1,800 Exabytes: Size of the digital universe in 2011 [1]
|
| 281EB |
|
| 90% |
- 90% of the data in the world today has been created in the last two years alone [4]
|
| 85% |
- enterprises are
- responsible for the security, privacy, reliability, and
- compliance of 85%
|
| 70% |
- 70% of digital universe is created by individuals [1]
- enterprises are responsible for the security, privacy, reliability, and compliance of 85%
|
| 18 |
- 18 Months is the estimated time for the digital universe to double [1]
|
| 10x |
- Growth expected in the size of the digital universe between 2006-2011 [1]
|
| <100th |
- In 2007, the size of digital universe was less than a hundredth of Avogadro’s constant (602,200,000,000,000,000,000,000)
- Avogadro’s constant is the number of carbon atoms in 12 grams
|
| SRC |
|
|
BIG DATA BLOGS > |
|
|
|
| BIG DATA PROJECTS > |
- MapReduce Online PDF by Yahoo!: For real-time streaming with Hadoop
- S4 by Yahoo!: a general-purpose, distributed, scalable, partially fault-tolerant, pluggable platform ()
- Karmasphere: Develop, Debug and Monitor Haddop big data analytics.
|
|
| BIG DATA STARTUPS> |
|
| BIG DATA RESEARCH PAPERS > |
|
|
| |
|
| |
| BIG DATA INTRODUCTION/ARTICLES > |
- Big data | Wikipedia
- Big data innovation
- Cassandra
- Data Scientist
- Eventual Consistency
- Related
|
|
| BIG DATA PLATFORMS > |
- EC2 - Elastic Compute Cloud
- Elastic MapReduce
- Google App Engine
|
| BIG DATA TECHNOLOGIES > |
|
GRAPH DATABASES
- AllegroGraph
- Angrapa
- Bigdata
- CloudGraph
- Cytoscape
- DEX
- Filament
- FlockDB
- Giraph
- GoldenOrb
- GraphBase
- Graphd
- Horton
- HyperGraphDB
- InfiniteGraph
- InfoGrid
- Neo4j
- OrientDB
- Phoebus
- Pregel
- sones GraphDB
- Trinity
- VertexDB
|
| BIG DATA PLATFORMS > |
- EC2 - Elastic Compute Cloud
- Elastic MapReduce
- Google App Engine
|
| BIG DATA RESEARCH PAPERS > |
|
|
|
© 2010-2011 big data, low latency
|