Distributed storage cluster and Hadoop

  1. what is big data?
  1. what is big data?
Big data problems
  • -> if any industry capable to make such appliance that can store such big data then one more problem occurs i.e. velocity (I/O), system works very slow and storing the data i.e output and read the data i.e input becomes very slow.
8 v’s of big data
  • -> two major big data problem is:
  • volume(size) and velocity(I/O)
  • Volume is major issue,when the size of data is larger than the size of storage in appliance.
  • Velocity, we always want fastly upload and retrieve the data, but due to big data problem this process becomes very slow.
  • -> by using this method, we split the file in different portion and store it in different storage.
  • -> it uses the topology of master-slave model, in which one system or server is master connected with different slaves, that gives master its storage and client contact directly to master to store the data.
  • -> master is called namenode and slaves are datanode.




Learner & Blogger

Love podcasts or audiobooks? Learn on the go with our new app.

Recommended from Medium

Googling as a 21st Century Skill 🤹

Python® Notes for Professionals book

WebAssembly in the Cloud

Code navigatability?

Rails: In search of DRY-Land

What they publish: This women’s site is primarily, but

Virtual Internship At Let’s Grow More Community

Shibboleth: HackTheBox Walkthrough

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Abhishek kumar

Abhishek kumar

Learner & Blogger

More from Medium

About Kubernetes architecture (2)

Getting Started With Apache Kafka

Get there faster! Performance improvements in Nuxeo, and what to watch out for — Part 1

Elasticsearch for Multi-Tenant Architecture