Non-relational (NoSQL) Databases

You have already met the relational database: data split into neat tables of rows and columns, linked by keys, and queried with SQL. It is one of the great inventions of computing, and for good reason — it stores each fact once and keeps data consistent.

But that tidiness comes from a strict rule: before you store anything, you must decide the exact shape of every table — its columns and their types. That fixed plan is called the schema. It is brilliant when your data is regular and well understood. It becomes a straitjacket when your data is huge, changing constantly, or messy and half-structured — think social-media posts, sensor streams, or the billions of events a global app records every day. This is where NoSQL (non-relational) databases come in.

The problem with a fixed schema

Imagine a social network. Every user has a name — easy. But one user adds a nickname, another links three Instagram accounts, a third writes a 500-word bio, and a fourth adds none of these. In a relational table you must invent a column in advance for every possible field, and rows that don't use it sit half-empty:

Users (relational — one rigid table)
UserID	Name	Nickname	Bio	Insta1	Insta2	Insta3
1	Priya	Pri	—	—	—	—
2	Sam	—	—	@sam	@sam_art	@sam_food
3	Ada	—	Loves maths…	—	—	—

The red gaps are wasted, and the design is fragile: the day someone wants a fourth Instagram link, an administrator must alter the whole table. Multiply this by billions of users and dozens of ever-changing features, and the fixed schema starts to hurt. NoSQL asks a different question: what if each record could simply carry whatever fields it needs?

The document idea: store the record whole

The most popular kind of NoSQL database is the document database (for example MongoDB). Instead of scattering one user across several rigid columns, it stores each user as a single self-contained document — a JSON-like object that carries exactly the fields that user has, and no others:

// Two documents in the SAME collection — with DIFFERENT shapes. No schema forbids it. { "userID": 1, "name": "Priya", "nickname": "Pri" } { "userID": 2, "name": "Sam", "instagram": ["@sam", "@sam_art", "@sam_food"], // a list, nested right inside "verified": true }

No empty cells, no wasted columns, and no table to alter when a new field appears — you just add it to the next document. A document can even nest lists and objects inside itself, so related data that a relational design would split across three joined tables can live together in one record. Step through the picture to see the two styles side by side:

That flexibility is the whole trade. A relational database guarantees a rigid, consistent structure; a document database gives that up in exchange for schema flexibility — the freedom to let each record differ and to evolve the data's shape without a painful migration.

The four families of NoSQL

“NoSQL” is really an umbrella over several designs. You should recognise the four main families and roughly what each is good at:

Document

Records are whole JSON-like documents; fields vary from one to the next.

e.g. MongoDB, CouchDB

Good for: user profiles, content, catalogues — data that varies and evolves.

Key–value

The simplest store of all: a giant dictionary of key → value. Ask for a key, get its value back, blisteringly fast.

e.g. Redis, DynamoDB

Good for: caches, shopping baskets, session data, leaderboards.

Column-family

Data grouped into columns rather than rows, so you can read one attribute across billions of records efficiently.

e.g. Cassandra, HBase

Good for: huge write-heavy workloads, time-series and analytics.

Graph

Stores nodes and the relationships between them as first-class things, so “friends of friends” is one cheap hop, not a chain of joins.

e.g. Neo4j

Good for: social networks, recommendations, routing, fraud detection.

A key–value store is really a one-field document store; a document store is a richer key–value store. They share a family resemblance: drop the rigid table of columns, and organise data around how the application actually reads and writes it.

A key–value store like Redis keeps its data in RAM, not on disk, and does essentially one thing: look up a key. There are no joins to compute, no query planner, no rows to scan — just “here is the value for that key.” That makes reads and writes happen in microseconds, which is why real-time games use it for live leaderboards, and busy websites use it as a cache sitting in front of a slower relational database. The price of that speed is that it does very little else: you can't ask it a rich question the way SQL lets you. Different job, different tool.

Scaling out: many cheap servers, not one giant one

The other big reason NoSQL exists is scale. When one database server can no longer cope, you have two choices:

Scale vertically (scale up): buy a bigger, more powerful single machine. There is always a ceiling — and the biggest machines cost a fortune. Relational databases have traditionally leaned this way, because splitting linked tables and joins across machines is hard.
Scale horizontally (scale out): add more ordinary machines and spread the data across them. There is (almost) no ceiling — need more capacity, add another server. NoSQL systems are built for this: because records are self-contained (a whole document, or a key and its value), a record can live on any server without needing to join across the others.

Spreading data across many servers is called sharding, and it is what lets NoSQL handle the “big data” volumes — billions of records, millions of writes a second — that a single relational server could never touch.

Relational databases promise ACID transactions — in short, that a change either fully happens or fully doesn't, and that everyone reading the database sees the same up-to-date answer (strong consistency). That guarantee is easy on one machine but expensive across a hundred. So many NoSQL systems relax it to eventual consistency: after a write, different servers may briefly disagree, but they “catch up” a moment later. For a bank balance that is unacceptable; for a count of likes on a post, nobody minds if it's off by one for half a second. NoSQL trades a little consistency for a lot of scale — on purpose.

At a glance

Relational (SQL) vs non-relational (NoSQL)
Feature	Relational (SQL)	NoSQL (non-relational)
Structure	Tables of rows & columns, linked by keys	Documents, key–value pairs, columns, or graphs
Schema	Fixed — decided in advance	Flexible — often none; records can differ
Query language	SQL (a shared standard)	Varies by product (API/query per database)
Scaling	Usually vertical (a bigger machine)	Horizontal (many machines / sharding)
Consistency	Strong (ACID transactions)	Often relaxed (eventual consistency)
Joins	Built in and powerful	Usually avoided; data stored together instead
Best for	Structured, related data needing consistency	Huge, fast-changing or unstructured data at scale

It is tempting to read “newer, flexible, web-scale” as “better,” but that is the classic mistake. NoSQL is not an upgrade from relational databases — it is a different trade-off. You gain flexibility and horizontal scale; you usually give up the fixed structure, powerful joins, and the cast-iron ACID consistency that make relational databases so trustworthy for money, bookings and records.

Choose based on the data and how you access it, not on fashion: highly structured data with important consistency (a bank, a school's records) ⇒ relational; huge, varied or fast-evolving data where scale matters more than strict consistency (a social feed, a sensor network) ⇒ NoSQL. And in the real world it is rarely either/or — a single large system often uses both: a relational database for orders and payments, Redis as a cache, and a document store for the product catalogue. This is called polyglot persistence: the right store for each job.