CARVIEW

MOTORHOMES

Select Language

HTTP/2 302 server: nginx date: Tue, 23 Dec 2025 18:34:27 GMT content-type: text/plain; charset=utf-8 content-length: 0 x-archive-redirect-reason: found capture at 20100529010201 location: https://web.archive.org/web/20100529010201/https://github.com/twitter/flockdb/ server-timing: captures_list;dur=0.969124, exclusion.robots;dur=0.072061, exclusion.robots.policy;dur=0.054671, esindex;dur=0.015162, cdx.remote;dur=26.905272, LoadShardBlock;dur=313.741889, PetaboxLoader3.datanode;dur=69.885012, PetaboxLoader3.resolve;dur=102.167382 x-app-server: wwwb-app206-dc6 x-ts: 302 x-tr: 391 server-timing: TR;dur=0,Tw;dur=0,Tc;dur=1 set-cookie: wb-p-SERVER=wwwb-app206; path=/ x-location: All x-as: 14061 x-rl: 0 x-na: 0 x-page-cache: MISS server-timing: MISS x-nid: DigitalOcean referrer-policy: no-referrer-when-downgrade permissions-policy: interest-cohort=() HTTP/2 200 server: nginx date: Tue, 23 Dec 2025 18:34:28 GMT content-type: text/html; charset=utf-8 x-archive-orig-server: nginx/0.7.61 x-archive-orig-date: Sat, 29 May 2010 01:02:01 GMT x-archive-orig-connection: close x-archive-orig-status: 200 OK x-archive-orig-etag: "20027098bbd7a004de5ce7b800dff1c3" x-archive-orig-x-runtime: 57ms x-archive-orig-content-length: 28390 x-archive-orig-cache-control: private, max-age=0, must-revalidate x-archive-guessed-content-type: text/html x-archive-guessed-charset: utf-8 memento-datetime: Sat, 29 May 2010 01:02:01 GMT link: ; rel="original", ; rel="timemap"; type="application/link-format", ; rel="timegate" content-security-policy: default-src 'self' 'unsafe-eval' 'unsafe-inline' data: blob: archive.org web.archive.org web-static.archive.org wayback-api.archive.org athena.archive.org analytics.archive.org pragma.archivelab.org wwwb-events.archive.org x-archive-src: 52_16_20100528224727_crawl100-c/52_16_20100529010026_crawl101.arc.gz server-timing: captures_list;dur=0.617646, exclusion.robots;dur=0.020962, exclusion.robots.policy;dur=0.009681, esindex;dur=0.011274, cdx.remote;dur=16.401711, LoadShardBlock;dur=217.551592, PetaboxLoader3.datanode;dur=122.420653, PetaboxLoader3.resolve;dur=129.063681, load_resource;dur=177.019149 x-app-server: wwwb-app206-dc6 x-ts: 200 x-tr: 468 server-timing: TR;dur=0,Tw;dur=0,Tc;dur=0 x-location: All x-as: 14061 x-rl: 0 x-na: 0 x-page-cache: MISS server-timing: MISS x-nid: DigitalOcean referrer-policy: no-referrer-when-downgrade permissions-policy: interest-cohort=() content-encoding: gzip twitter's flockdb at master - GitHub

twitter / flockdb

A distributed, fault-tolerant graph database. — Read more

This URL has Read+Write access

fix versions to match sbt.

robey (author)

Thu May 27 15:39:46 -0700 2010

commit daa67b5605cc07b34813
tree 3585e7ae23bcabdb15ab
parent a62f7481df4470c4ea1b

flockdb /

name	age	history message
.gitignore	Thu May 13 16:55:44 -0700 2010	ignore me! [robey]
LICENSE	Sun Apr 11 21:28:31 -0700 2010	adding license and some sketch of basic client-... [Nick Kallen]
README.markdown	Tue May 18 13:52:28 -0700 2010	mention that thrift 0.2.0 is required. [robey]
TODO	Sun Apr 11 20:54:00 -0700 2010	initial source import [Nick Kallen]
ant/	Sun Apr 11 20:54:00 -0700 2010	initial source import [Nick Kallen]
build.xml	Tue Apr 27 13:41:44 -0700 2010	unused build target. [robey]
config/	Thu May 20 16:17:01 -0700 2010	don't need so many threads in dev mode. :) [robey]
doc/	Mon May 03 22:36:03 -0700 2010	fix another typo. [robey]
ivy/	Thu May 27 15:39:46 -0700 2010	fix versions to match sbt. [robey]
libs/	Mon Apr 12 12:43:02 -0700 2010	use gizzard from nest instead of committing it ... [robey]
project/	Thu May 27 15:39:35 -0700 2010	don't need to list these explicitly. [robey]
src/	Tue May 25 16:02:17 -0700 2010	tweaked setup script [freels]

README.markdown

FlockDB

FlockDB is a distributed graph database for storing adjancency lists, with goals of supporting:

a high rate of add/update/remove operations
potientially complex set arithmetic queries
paging through query result sets containing millions of entries
ability to "archive" and later restore archived edges
horizontal scaling including replication
online data migration

Non-goals include:

multi-hop queries (or graph-walking queries)
automatic shard migrations

FlockDB is much simpler than other graph databases such as neo4j because it tries to solve fewer problems. It scales horizontally and is designed for on-line, low-latency, high throughput environments such as web-sites.

Twitter uses FlockDB to store social graphs (who follows whom, who blocks whom) and secondary indices. As of April 2010, the Twitter FlockDB cluster stores 13+ billion edges and sustains peak traffic of 20k writes/second and 100k reads/second.

It does what?

If, for example, you're storing a social graph (user A follows user B), and it's not necessarily symmetrical (A can follow B without B following A), then FlockDB can store that relationship as an edge: node A points to node B. It stores this edge with a sort position, and in both directions, so that it can answer the question "Who follows A?" as well as "Whom is A following?"

This is called a directed graph. (Technically, FlockDB stores the adjacency lists of a directed graph.) Each edge has a 64-bit source ID, a 64-bit destination ID, a state (normal, removed, archived), and a 32-bit position used for sorting. The edges are stored in both a forward and backward direction, meaning that an edge can be queried based on either the source or destination ID.

For example, if node 134 points to node 90, and its sort position is 5, then there are two rows written into the backing store:

forward: 134 -> 90 at position 5
backward: 90 <- 134 at position 5

If you're storing a social graph, the graph might be called "following", and you might use the current time as the position, so that a listing of followers is in recency order. In that case, if user 134 is Nick, and user 90 is Robey, then FlockDB can store:

forward: Nick follows Robey at 9:54 today
backward: Robey is followed by Nick at 9:54 today

The (source, destination) must be unique: only one edge can point from node A to node B, but the position and state may be modified at any time. Position is used only for sorting the results of queries, and state is used to mark edges that have been removed or archived (placed into cold sleep).

Building

In theory, building is as simple as

$ ant

but there are some pre-requisites. You need:

java 1.6
ant 1.7
thrift 0.2.0

In addition, the tests require a local mysql instance to be running, and for DB_USERNAME and DB_PASSWORD env vars to contain login info for it. You can skip the tests if you want:

$ ant -Dskip.test=1

There should be support for building with sbt "soon".

Running

Check out the demo for instructions on how to start up a local development instance of FlockDB. It also shows how to add edges, query them, etc.

Community

Twitter: #flockdb
IRC: #twinfra on freenode (irc.freenode.net)
Mailing list: flockdb@googlegroups.com subscribe

Contributors

Nick Kallen @nk
Robey Pointer @robey
John Kalucki @jkalucki
Ed Ceaser @asdf

Original Source | Taken Source