What is zookeeper?

21 Jan 2024

ZooKeeper is Hadoops distributed coordination service. It can’t make partial failures go away but it can help you handle partial failures.

Its simple, a stripped down filesystem that exposes a few simple operations
Its expressive, it can be used to build coordination data structures and protocols
Its available, applications can depend on it
It helps loosely coupled interactions, helps computers find other finders
its a library, It lets open source use tried and tested protocols

How can we have an actively maintained list of active servers?

Data Model

Data access is atomic. In reads and writes. No partial failures.
Znodes can be ephemeral or persistent. Ephemeral nodes are deleted when creating client session ends.
You can have sequential znodes, where file paths are generated with numbers. This gives you clear global ordering!
You can create watches. One off znode change notification alerts!

Zookeeper service runs on a cluster. Gets high availability with replication.

All updates to znode tree have a globally unique identifier.

Zookeeper comes with prebuilt distributed data structures and protocols you can use.

oboe