NoSQL Data Modeling Techniques

The document discusses various distribution models for NoSQL databases, emphasizing the benefits and complexities of scaling out using clusters versus single-server setups. It covers techniques such as sharding, master-slave replication, and peer-to-peer replication, highlighting their advantages for read and write scalability. Additionally, it explains key-value stores, their features, suitable use cases, and scenarios where they may not be the best choice.

Uploaded by

kamisettysatyasatvik2005

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views21 pages

NoSQL Data Modeling Techniques

Uploaded by

kamisettysatyasatvik2005

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Data Modelling with NoSQL

databases
Module-2
By
Dr Shivakumar C
Distribution Models
• The primary driver of interest in NoSQL has been its ability to run databases
on a large cluster.
• As data volumes increase, it becomes more difficult and expensive to scale
up buy a bigger server to run the database on.
• A more appealing option is to scale out run the database on a cluster of
servers.
• Aggregate orientation fits well with scaling out because the aggregate is a
natural unit to use for distribution.
Distribution Models
• Depending on your distribution model, you can get a data store that will
give you the ability to handle larger quantities of data, the ability to process
a greater read or write traffic, or more availability in the face of network
slowdowns or breakages.
• These are often important benefits, but they come at a cost.
• Running over a cluster introduces complexity so it’s not something to do
unless the benefits are compelling.
Single Server
• The first and the simplest distribution option is the one we would most often
recommend no distribution at all.
• Run the database on a single machine that handles all the reads and
writes to the data store.
• We prefer this option because it eliminates all the complexities that the other
options introduce;
• it’s easy for operations people to manage and easy for application developers
to reason about.
Single Server
• We can use NoSQL with a single-server distribution model if the data model
of the NoSQL store is more suited to the application.
• Graph databases are the obvious category here these work best in a
single-server configuration.
• If your data usage is mostly about processing aggregates, then a
single-server document or key-value store may well be worthwhile
because it’s easier on application developers.
Sharding
• Often, a busy data store is busy
because different people are
accessing different parts of the
dataset.
• In these circumstances we can
support horizontal scalability by
putting different parts of the
data onto different servers a
technique that’s called sharding.
Sharding
• We have different users all talking to different server nodes. Each user only has to talk to one
server, so gets rapid responses from that server.
• The load is balanced out nicely between servers.
• In order to get close to it we have to ensure that data that’s accessed together is clumped together
on the same node and that these clumps are arranged on the nodes to provide the best data access.
• Rebalancing the sharding means changing the application code and migrating the data.
• Many NoSQL databases offer auto-sharding, where the database takes on the responsibility of
allocating data to shards and ensuring that data access goes to the right shard.
• This can make it much easier to use sharding in an application.
Master-Slave Replication
• With master-slave distribution, you
replicate data across multiple nodes. One
node is designated as the master, or
primary.
• This master is the authoritative source for
the data and is usually responsible for
processing any updates to that data.
• The other nodes are slaves, or
secondaries.
• A replication process synchronizes the
slaves with the master
Master-Slave Replication
• Master-slave replication is most helpful for scaling when you have a
read-intensive dataset.
• A second advantage of master-slave replication is read resilience, Should
the master fail, the slaves can still handle read requests.
• Again, this is useful if most of your data access is reads. The failure of the
master does eliminate the ability to handle writes until either the master is
restored or a new master is appointed.
Peer-to-Peer Replication
• Master-slave replication helps with read scalability
but doesn’t help with scalability of writes.
• It provides resilience against failure of a slave, but
not of a master.
• Essentially, the master is still a bottleneck and a
single point of failure.
• Peer-to-peer replication attacks these problems by
not having a master.
• All the replicas have equal weight, they can all
accept writes, and the loss of any of them doesn’t
prevent access to the data store.
Peer-to-Peer Replication
• With a peer-to-peer replication cluster, you can ride over node failures without
losing access to data.
• We can easily add nodes to improve your performance.
• The biggest complication is, again, consistency.
• When you can write to two different places, you run the risk that two people will
attempt to update the same record at the same time a write-write conflict.
• Inconsistencies on read lead to problems but at least they are relatively transient.
Inconsistent writes are forever.
Combining Sharding and Replication
• Replication and sharding are strategies
that can be combined.
• If we use both master-slave replication
and sharding this means that we have
multiple masters, but each data item only
has a single master.
• Depending on your configuration, you
may choose a node to be a master for
some data and slaves for others, or you
may dedicate nodes for master or slave
duties
Combining Sharding and Replication
• Using peer-to-peer replication and
sharding is a common strategy for
column-family databases.
• In a scenario like this you might have tens
or hundreds of nodes in a cluster with
data sharded over them.
• A good starting point for peer-to-peer
replication is to have a replication factor
of 3, so each shard is present on three
nodes. Should a node fail, then the shards
on that node will be built on the other
nodes
Key-Value Databases
• A key-value store is a simple hash table, primarily used when all access to
the database is via primary key.
• Ex: Think of a table in a traditional RDBMS with two columns, such as ID
and NAME, the ID column being the key and NAME column storing the
value. In an RDBMS, the NAME column is restricted to storing data of type
String.
What is a Key-Value Store
• Key-value stores are the simplest NoSQL data stores to use from an API
perspective.
• The client can either get the value for the key, put a value for a key, or
delete a key from the data store.
• Since key-value stores always use primary-key access, they generally have
great performance and can be easily scaled.
• Popular key-value databases are Riak , Redis, Hamster DB, Berkeley DB
, Amazon Dynamo DB
…..
• In some key-value stores, such as Redis, the aggregate being stored does
not have to be a domain object, it could be any data structure.
• Redis supports storing lists, sets, hashes and can do range, diff, union,
and intersection operations.
• These features allow Redis to be used in more different ways than a standard
key-value store.
Key-Value Store Features
• Consistency: Consistency is applicable only for operations on a single
key, since these operations are either a get, put, or delete on a single key.
• Transactions: Different products of the key-value store kind have different
specifications of transactions. Generally speaking, there are no guarantees on
the writes. Many data stores do implement transactions in different ways.
• Query Features: All key-value stores can query by the key and that’s about
it. If you have requirements to query by using some attribute of the value
column, it’s not possible to use the database.
Key-Value Store Features
• Structure of Data: Key-value databases don’t care what is stored in the
value part of the key-value pair. The value can be a blob, text, JSON, XML,
and so on.
• Scaling: Many key-value stores scale by using sharding. With sharding, the
value of the key determines on which node the key is stored.

• Note: BLOB-Binary Large Object are complex files such as images, video,
and audio.
Suitable Use Cases
• Storing Session Information
• User Profiles, Preferences
• Shopping Cart Data
When Not to Use
• Relationships among Data: If you need to have relationships between
different sets of data, or correlate the data between different sets of keys,
key-value stores are not the best solution to use, even though some key-value
stores provide link-walking features.
• Multioperation Transactions: If you’re saving multiple keys and there is a
failure to save any one of them, and you want to revert or roll back the rest
of the operations, key-value stores are not the best solution to be used.
When Not to Use
• Query by Data: If you need to search the keys based on something found in
the value part of the key-value pairs, then key-value stores are not going to
perform well for you.
• There is no way to inspect the value on the database side, with the exception
of some products like Riak Search or indexing engines like Lucene or Solr.
• Operations by Sets: Since operations are limited to one key at a time,
there is no way to operate upon multiple keys at the same time. If you need
to operate upon multiple keys, you have to handle this from the client side.

Sharding and Replication in NoSQL
100% (1)
Sharding and Replication in NoSQL
101 pages
Understanding NoSQL Databases
No ratings yet
Understanding NoSQL Databases
15 pages
NoSQL Data Models and Distribution Methods
No ratings yet
NoSQL Data Models and Distribution Methods
8 pages
Big Data Distribution Models Explained
100% (1)
Big Data Distribution Models Explained
24 pages
NoSQL for Big Data Management
No ratings yet
NoSQL for Big Data Management
36 pages
Understanding NoSQL Databases: Types & Benefits
No ratings yet
Understanding NoSQL Databases: Types & Benefits
60 pages
Master-Slave Replication in NoSQL
No ratings yet
Master-Slave Replication in NoSQL
76 pages
Overview: High Performance Scalable Data Stores
No ratings yet
Overview: High Performance Scalable Data Stores
19 pages
Key-Value Database Overview and Features
No ratings yet
Key-Value Database Overview and Features
7 pages
Key-Value Stores and NoSQL Benefits
No ratings yet
Key-Value Stores and NoSQL Benefits
7 pages
NoSQL Data Architecture Patterns Explained
No ratings yet
NoSQL Data Architecture Patterns Explained
18 pages
NoSQL Data Management for Big Data
100% (4)
NoSQL Data Management for Big Data
31 pages
Data Distribution Models in NoSQL Databases
No ratings yet
Data Distribution Models in NoSQL Databases
40 pages
Understanding NoSQL Databases and Types
No ratings yet
Understanding NoSQL Databases and Types
30 pages
Bda Unit-2
No ratings yet
Bda Unit-2
29 pages
Understanding NoSQL Database Types
No ratings yet
Understanding NoSQL Database Types
38 pages
Core Principles of NoSQL Databases
No ratings yet
Core Principles of NoSQL Databases
49 pages
Introduction to NoSQL Databases
No ratings yet
Introduction to NoSQL Databases
29 pages
Wide-Column Databases Explained
No ratings yet
Wide-Column Databases Explained
10 pages
NoSQL Data Replication and Distribution Models
100% (1)
NoSQL Data Replication and Distribution Models
87 pages
NoSQL Database Distribution Models Explained
No ratings yet
NoSQL Database Distribution Models Explained
13 pages
Understanding NoSQL Databases and Their Benefits
No ratings yet
Understanding NoSQL Databases and Their Benefits
8 pages
Key-Value Databases Explained
No ratings yet
Key-Value Databases Explained
83 pages
Understanding NoSQL Database Systems
No ratings yet
Understanding NoSQL Database Systems
20 pages
Understanding NoSQL Databases Explained
No ratings yet
Understanding NoSQL Databases Explained
18 pages
Understanding NoSQL Database Types
No ratings yet
Understanding NoSQL Database Types
32 pages
Understanding NoSQL Databases
No ratings yet
Understanding NoSQL Databases
22 pages
Key-Value Databases Overview
No ratings yet
Key-Value Databases Overview
32 pages
Introduction to NoSQL Databases
No ratings yet
Introduction to NoSQL Databases
26 pages
NoSQL Database Distribution Models
No ratings yet
NoSQL Database Distribution Models
36 pages
Understanding NoSQL Databases and Types
No ratings yet
Understanding NoSQL Databases and Types
31 pages
Overview of NoSQL Database Systems
No ratings yet
Overview of NoSQL Database Systems
20 pages
Peer-to-Peer Replication in NoSQL
No ratings yet
Peer-to-Peer Replication in NoSQL
12 pages
Understanding NoSQL Database Types
No ratings yet
Understanding NoSQL Database Types
98 pages
Understanding NoSQL Databases
No ratings yet
Understanding NoSQL Databases
31 pages
NoSQL for Big Data Management
No ratings yet
NoSQL for Big Data Management
9 pages
Introduction to NoSQL Databases
No ratings yet
Introduction to NoSQL Databases
18 pages
Overview of NoSQL Databases and Features
No ratings yet
Overview of NoSQL Databases and Features
25 pages
Types of NoSQL Databases Overview
No ratings yet
Types of NoSQL Databases Overview
42 pages
Data Distribution Models Explained
No ratings yet
Data Distribution Models Explained
40 pages
NoSQL Databases: Features and Models
No ratings yet
NoSQL Databases: Features and Models
143 pages
Overview of NoSQL Database Types
No ratings yet
Overview of NoSQL Database Types
19 pages
Key-Value Databases Explained
No ratings yet
Key-Value Databases Explained
22 pages
Introduction to NoSQL Databases
No ratings yet
Introduction to NoSQL Databases
32 pages
Core Principles of NoSQL Databases
No ratings yet
Core Principles of NoSQL Databases
50 pages
Understanding NoSQL Databases
No ratings yet
Understanding NoSQL Databases
30 pages
Understanding NoSQL Databases
No ratings yet
Understanding NoSQL Databases
45 pages
Understanding NoSQL Databases
No ratings yet
Understanding NoSQL Databases
8 pages
Overview of NoSQL Database Types
No ratings yet
Overview of NoSQL Database Types
4 pages
NoSQL Databases and Data Models Guide
No ratings yet
NoSQL Databases and Data Models Guide
11 pages
Understanding NoSQL Databases
No ratings yet
Understanding NoSQL Databases
3 pages
NOSQL Databases Overview and Applications
No ratings yet
NOSQL Databases Overview and Applications
6 pages
Overview of NoSQL Data Management
No ratings yet
Overview of NoSQL Data Management
33 pages
BDA
No ratings yet
BDA
9 pages
Understanding NoSQL Databases Basics
No ratings yet
Understanding NoSQL Databases Basics
31 pages
Understanding NoSQL Databases
No ratings yet
Understanding NoSQL Databases
11 pages
Riak CS Latency in NoSQL Systems
No ratings yet
Riak CS Latency in NoSQL Systems
49 pages
Cypher and NoSQL Graph Databases
No ratings yet
Cypher and NoSQL Graph Databases
78 pages
Understanding NoSQL Databases
No ratings yet
Understanding NoSQL Databases
23 pages
Cramer's Rule and Sequences Exercises
No ratings yet
Cramer's Rule and Sequences Exercises
2 pages
Understanding the Dividend Discount Model
50% (2)
Understanding the Dividend Discount Model
54 pages
Asphalt Rutting Resistance Test Method
No ratings yet
Asphalt Rutting Resistance Test Method
7 pages
Anaerobic Digester Heating Systems
No ratings yet
Anaerobic Digester Heating Systems
5 pages
Wage Calculations for Piecework Systems
No ratings yet
Wage Calculations for Piecework Systems
8 pages
Arithmetic Sequence Worksheet Solutions
No ratings yet
Arithmetic Sequence Worksheet Solutions
2 pages
Grade 10 Science Lesson Plan: EM Spectrum
No ratings yet
Grade 10 Science Lesson Plan: EM Spectrum
13 pages
Brochure Comos-Portfolio en
No ratings yet
Brochure Comos-Portfolio en
8 pages
Grade 7 Math Lesson Plan: Measurement
No ratings yet
Grade 7 Math Lesson Plan: Measurement
5 pages
Diode and SCR Circuit Analysis Problems
No ratings yet
Diode and SCR Circuit Analysis Problems
8 pages
Evolution of Automatic Control Systems
No ratings yet
Evolution of Automatic Control Systems
8 pages
Kohler Engine Governor Adjustment Guide
No ratings yet
Kohler Engine Governor Adjustment Guide
1 page
Machine Design Exam Paper - July 2023
No ratings yet
Machine Design Exam Paper - July 2023
3 pages
VTU Connect: Student Resource App
No ratings yet
VTU Connect: Student Resource App
7 pages
PGCET MCA Exam Pattern and Syllabus
No ratings yet
PGCET MCA Exam Pattern and Syllabus
2 pages
Datasheet PDF
No ratings yet
Datasheet PDF
2 pages
Fastener Symbols for Airbus Drawings
100% (1)
Fastener Symbols for Airbus Drawings
18 pages
Shellcode
No ratings yet
Shellcode
8 pages
Operation Control for MV Drive System
No ratings yet
Operation Control for MV Drive System
11 pages
Identifying E Alkenes in Organic Chemistry
No ratings yet
Identifying E Alkenes in Organic Chemistry
17 pages
Numerical Analysis Exam Review Guide
No ratings yet
Numerical Analysis Exam Review Guide
4 pages
Nonidealities in Op Amps Explained
No ratings yet
Nonidealities in Op Amps Explained
17 pages
Steps in Microbiological Analysis
No ratings yet
Steps in Microbiological Analysis
4 pages
Math Exam Marking Scheme Overview
No ratings yet
Math Exam Marking Scheme Overview
7 pages
Word Frequency Analysis in Game 227
No ratings yet
Word Frequency Analysis in Game 227
1 page
Year 7-10 Science Curriculum Overview
No ratings yet
Year 7-10 Science Curriculum Overview
15 pages
Beginner's Guide to Beer Making
No ratings yet
Beginner's Guide to Beer Making
5 pages
Taxonomy of Aspergillus Section Aspergillus
No ratings yet
Taxonomy of Aspergillus Section Aspergillus
99 pages
Electrochemistry Formula Sheet Guide
100% (1)
Electrochemistry Formula Sheet Guide
2 pages
Java Programming Basics Lab Guide
No ratings yet
Java Programming Basics Lab Guide
16 pages

NoSQL Data Modeling Techniques

Uploaded by

NoSQL Data Modeling Techniques

Uploaded by

Data Modelling with NoSQL

You might also like