Open Knowledge Foundation Labs

Recline Mozilla CSV Viewer

A FireFox extension which allows you to view, search, graph and map CSV files in the browser (built using Recline). This is an port of the great Rufus Pollock's chrome-csv-viewer on FireFox.

Github

Open Knowledge Labs website
maintained by Andy Lulham and Rufus Pollock

The Open Knowledge Labs website (i.e. the site you’re looking at right now) is itself a collaborative project of Open Knowledge. It is built using Jekyll, a static site generator, and hosted on GitHub Pages. You can contribute by addressing some of the items on the project’s GitHub issue queue (hint: you can start with the easy ones).

Github

Head Start
maintained by Peter Kraker, Philipp Weißensteiner, Fabian Dablander, Christopher Kittel

Head Start is intended for scholars who want to get an overview of a research field. They could be young PhDs getting into a new field, or established scholars who venture into a neighboring field. The idea is that you can see the main areas and papers in a field at a glance without having to do weeks of searching and reading. A prototypical implementation for the field of educational technology can be found on Mendeley Labs. The visualization is also used in Conference Navigator 3 and in the Organic Edunet portal.

Github

Ivo of Chartres
maintained by Bruce Brasington, Martin Brett, Przemyslaw Nowak, Christof Rolker

A collection of the works of Ivo, bishop of Chartres.

This site has four elements

draft texts and some concordances for the three collections traditionally associated with Ivo of Chartres:
- the Collectio Tripartita
- the Decretum and
- the Panormia
a draft list of manuscripts which contain a significant number of Ivo’s letters.

Github

CSV.js
maintained by Rufus Pollock

Simple javascript CSV library focused on the browser with zero dependencies. Supports both parsing and serializing CSV.

Originally developed as part of ReclineJS but now fully standalone.

Github

Data Pipes
maintained by Andy Lulham, Rufus Pollock and David Miller

Data Pipes is a service to provide streaming, "pipe-like" data transformations on the web – things like deleting rows or columns, find and replace, head, grep etc.

Github

reconcile-csv
maintained by Michael Bauer

Reconcile-CSV is an OpenRefine reconciliation service running on top of a CSV file. It uses fuzzy matching to find the most likely candidates for matching. If you ever needed to join two datasets that didn't have unique identifiers and where things are written slightly different: this is a way to go.

Github

Open Correspondence
maintained by Iain Emsley

Open Correspondence is an attempt to explore the letters network of the nineteenth century. At the moment, the project contains some of the letters of Charles Dickens, but we're working to expand to it include many other authors such as Jane Austen, George Eliot and Byron.

Github

ElasticSearch.JS
maintained by Rufus Pollock

A simple javascript library for working with ElasticSearch.

It also provides a backend interface to ElasticSearch suitable for use with the Recline suite of data libraries.

Github

Data Explorer
maintained by Rufus Pollock, Dan Wilson & others

Data Explorer is an in-browser data cleaning and visualization app. Load tabular data, process it with JavaScript, and graph the results, all in the comfort of your browser. Gist-based persistence enables simple versioning and sharing of projects.

Github

Frictionless Data
maintained by Labs

There’s too much friction working with data - friction getting data, friction processing data, friction sharing data.

This friction stops people doing stuff: stops them creating, sharing, collaborating, and using data - especially amongst more distributed communities. It kills the cycles of find, improve, share that would make for a dynamic, productive and attractive (open) data ecosystem.

We need to make an ecosystem that, like open-source for software, is useful and attractive to those without any principled interest, the vast majority who simply want the best tool for the job, the easiest route to their goal.

We think that by getting a few key pieces in place we can reduce friction enough to revolutionize how the (open) data ecosystem operates with massively improved data quality, utilization and sharing.

We’re creating:

Standards: A small set of lightweight ‘data package’ standards and patterns providing a base structure on which tooling and integration can build.
Tooling and Integration: Making it easy to use and publish data packages from your existing apps and workflows whether that’s Excel, R, or Hadoop!
Outreach and Community: Engaging and evangelizing around the concepts, standards and tooling and building a community of users and contributors.

Github

ReclineJS
maintained by Rufus Pollock, Max Ogden & others

Recline is a simple but powerful library for building data applications in pure Javascript and HTML. Building on Backbone, Recline supplies components and structure to data-heavy applications by providing a set of models (Dataset, Record/Row, Field) and views (Grid, Map, Graph etc).

Github

Bubble Tree Library
maintained by Gregor Aisch & David McCandless

The BubbleTree can be used to display hierarchical (spending) data in an interactive visualization. The setup is easy and independent from the OpenSpending platform. However, there is an optional integration module to connect with data from the OpenSpending API.

Github

Nomenklatura
maintained by Friedrich Lindenberg

Nomenklatura de-duplicates and integrates different names for entities - people, organisations or public bodies - to help you clean up messy data and to find links between different datasets. The service will create references for all entities mentioned in a source dataset. It then helps you to define which of these entities are duplicates and what the canonical name for a given entity should be. This information is available in data cleaning tools like OpenRefine or in custom data processing scripts, so that you can automatically apply existing mappings in the future. The focus of nomenklatura is on data integration, it does not provide further functionality with regards to the people and organisations that it helps to keep track of.

Nomenklatura is a simple service that makes it easy to maintain a canonical list of entities such as persons, companies or event streets and to match messy input, such as their names against that canonical list – for example, matching Acme Widgets, Acme Widgets Inc and Acme Widgets Incorporated to the canonical "Acme Widgets".

With Nomenklatura its a matters of minutes to set up your own set of master data to match against and it provides a simple user interface and API which you can then use do matching (the API is compatible with Open Refine's reconciliation function).

Nomenklatura can not only store the master set of entities you want to match against but also will learn and record the various aliases for a given entity - such as a person, organisation or place - may have in various datasets.

Github

CrowdCrafting & PyBossa
maintained by Daniel Lombraña González

CrowdCrafting is a free, open-source crowd-sourcing and micro-tasking platform powered by the PyBossa software. This platform enables people to create and run projects that utilize on-line assistance in performing tasks that require human cognition such as image classification, transcription, geocoding and more. CrowdCrafting is there to help researchers, civic hackers and developers to create projects where anyone around the world with some time, interest and an Internet connection can contribute.

Github

Froide
maintained by Stefan Wehrmeyer

Froide is a Freedom of Information portal written in Python using the Django Web framework. It manages contactable entities, requests and much more. Users can send emails to these entities and receive public answers via the platform.

It was developed to power Frag den Staat – the German Freedom of Information Portal, but is internationalized, localized and themable and has deployed in several different countries.

Frag Den Staat
maintained by Stefan Wehrmeyer

Germany Freedom of Information portal powered by the Froide platform. Responsible for managing over a 1/3 of all Freedom of Information request in Germany.

Github

Annotator
maintained by Nick Stenning & Aron Carroll

The Annotator is an open-source JavaScript library and tool that can be added to any webpage to make it annotatable. Annotations can have comments, tags, users and more. Morever, the Annotator is designed for easy extensibility so its a cinch to add a new feature or behaviour.

Github

Retrato da Violência
maintained by Vitor Baptista, Leo Tartari, Thiago Bueno

Visualization on violence against women in the brazilian state Rio Grande do Sul.

Github

Textus
maintained by Tom Oinn and Rufus Pollock

In a nutshell it is an open source platform for working with collections of texts. It enables students, researchers and teachers to share and collaborate around texts using a simple and intuitive interface.

Github

IATI Tools
maintained by Mark Brough

Library for working with IATI data and converting it into a relational database. http://blog.okfn.org/2012/06/05/from-xml-to-visualisations-iati/

Github

Listify
maintained by Rufus Pollock

Turn a Google spreadsheet into a beautiful, searchable listing in seconds

Github

TimeMapper
maintained by Rufus Pollock

Make timelines & maps from a Google Spreadsheet in seconds

Github

BundesGit
maintained by Stefan Wehrmeyer

Github

Recline Chrome CSV Viewer

A chrome extension which allows you to view, search, graph and map CSV files in the browser (built using Recline)

Github

Kartograph
maintained by Gregor Aisch

Kartograph is a simple and lightweight framework for building interactive map applications without Google Maps or any other mapping service. It was created with the needs of designers and data journalists in mind.

Github

MessyTables
maintained by David Raznick, Friedrich Lindenberg, Dominik Moritz

Tools for parsing messy tabular data. http://okfnlabs.org/blog/2012/10/22/messytables.html

Github

Data Converters
maintained by Nigel Babu, Rufus Pollock

Python library and command line tool for converting data from one format to another. It builds on messytables, GDAL and many more great open-source libraries for processing data, and provides one easy to use standard API.

Github

WikipediaJS
maintained by Rufus Pollock

WikipediaJS is a simple JS library for accessing information in Wikipedia articles such as dates, places, abstracts etc. The library is the work of Labs member Rufus Pollock. In essence, it is a small wrapper around the data and APIs of the DBPedia project and it is they who have done all the heavy lifting of extracting structured data from Wikipedia - huge credit and thanks to DBPedia folks!

Github

For Your Information
maintained by Rowan Crawford

Make Official Information Act requests in New Zealand

Github

OffenesParlement
maintained by Friedrich Lindenberg

OffenesParlement is a site (and open-source codebase) for gathering and presenting information about the work of the Bundestag and Bundesrat.

Github

ActivityAPI
maintained by Tom Rees

A web service for aggregating and querying online activity

Github

[Add a Project]
Labs Projects

Recline Mozilla CSV Viewer

Open Knowledge Labs website
maintained by Andy Lulham and Rufus Pollock

Head Start
maintained by Peter Kraker, Philipp Weißensteiner, Fabian Dablander, Christopher Kittel

Ivo of Chartres
maintained by Bruce Brasington, Martin Brett, Przemyslaw Nowak, Christof Rolker

CSV.js
maintained by Rufus Pollock

Data Pipes
maintained by Andy Lulham, Rufus Pollock and David Miller

reconcile-csv
maintained by Michael Bauer

Open Correspondence
maintained by Iain Emsley

ElasticSearch.JS
maintained by Rufus Pollock

Data Explorer
maintained by Rufus Pollock, Dan Wilson & others

Frictionless Data
maintained by Labs

ReclineJS
maintained by Rufus Pollock, Max Ogden & others

Bubble Tree Library
maintained by Gregor Aisch & David McCandless

Nomenklatura
maintained by Friedrich Lindenberg

CrowdCrafting & PyBossa
maintained by Daniel Lombraña González

Froide
maintained by Stefan Wehrmeyer

Read more

Frag Den Staat
maintained by Stefan Wehrmeyer

Annotator
maintained by Nick Stenning & Aron Carroll

Retrato da Violência
maintained by Vitor Baptista, Leo Tartari, Thiago Bueno

Textus
maintained by Tom Oinn and Rufus Pollock

IATI Tools
maintained by Mark Brough

Listify
maintained by Rufus Pollock

TimeMapper
maintained by Rufus Pollock

BundesGit
maintained by Stefan Wehrmeyer

Recline Chrome CSV Viewer

Kartograph
maintained by Gregor Aisch

MessyTables
maintained by David Raznick, Friedrich Lindenberg, Dominik Moritz

Data Converters
maintained by Nigel Babu, Rufus Pollock

WikipediaJS
maintained by Rufus Pollock

For Your Information
maintained by Rowan Crawford

OffenesParlement
maintained by Friedrich Lindenberg

ActivityAPI
maintained by Tom Rees

[Add a Project] Labs Projects

Recline Mozilla CSV Viewer

Open Knowledge Labs website maintained by Andy Lulham and Rufus Pollock

Head Start maintained by Peter Kraker, Philipp Weißensteiner, Fabian Dablander, Christopher Kittel

Ivo of Chartres maintained by Bruce Brasington, Martin Brett, Przemyslaw Nowak, Christof Rolker

CSV.js maintained by Rufus Pollock

Data Pipes maintained by Andy Lulham, Rufus Pollock and David Miller

reconcile-csv maintained by Michael Bauer

Open Correspondence maintained by Iain Emsley

ElasticSearch.JS maintained by Rufus Pollock

Data Explorer maintained by Rufus Pollock, Dan Wilson & others

Frictionless Data maintained by Labs

ReclineJS maintained by Rufus Pollock, Max Ogden & others

Bubble Tree Library maintained by Gregor Aisch & David McCandless

Nomenklatura maintained by Friedrich Lindenberg

CrowdCrafting & PyBossa maintained by Daniel Lombraña González

Froide maintained by Stefan Wehrmeyer

Read more

Frag Den Staat maintained by Stefan Wehrmeyer

Annotator maintained by Nick Stenning & Aron Carroll

Retrato da Violência maintained by Vitor Baptista, Leo Tartari, Thiago Bueno

Textus maintained by Tom Oinn and Rufus Pollock

IATI Tools maintained by Mark Brough

Listify maintained by Rufus Pollock

TimeMapper maintained by Rufus Pollock

BundesGit maintained by Stefan Wehrmeyer

Recline Chrome CSV Viewer

Kartograph maintained by Gregor Aisch

MessyTables maintained by David Raznick, Friedrich Lindenberg, Dominik Moritz

Data Converters maintained by Nigel Babu, Rufus Pollock

WikipediaJS maintained by Rufus Pollock

For Your Information maintained by Rowan Crawford

OffenesParlement maintained by Friedrich Lindenberg

ActivityAPI maintained by Tom Rees

[Add a Project]
Labs Projects

Open Knowledge Labs website
maintained by Andy Lulham and Rufus Pollock

Head Start
maintained by Peter Kraker, Philipp Weißensteiner, Fabian Dablander, Christopher Kittel

Ivo of Chartres
maintained by Bruce Brasington, Martin Brett, Przemyslaw Nowak, Christof Rolker

CSV.js
maintained by Rufus Pollock

Data Pipes
maintained by Andy Lulham, Rufus Pollock and David Miller

reconcile-csv
maintained by Michael Bauer

Open Correspondence
maintained by Iain Emsley

ElasticSearch.JS
maintained by Rufus Pollock

Data Explorer
maintained by Rufus Pollock, Dan Wilson & others

Frictionless Data
maintained by Labs

ReclineJS
maintained by Rufus Pollock, Max Ogden & others

Bubble Tree Library
maintained by Gregor Aisch & David McCandless

Nomenklatura
maintained by Friedrich Lindenberg

CrowdCrafting & PyBossa
maintained by Daniel Lombraña González

Froide
maintained by Stefan Wehrmeyer

Frag Den Staat
maintained by Stefan Wehrmeyer

Annotator
maintained by Nick Stenning & Aron Carroll

Retrato da Violência
maintained by Vitor Baptista, Leo Tartari, Thiago Bueno

Textus
maintained by Tom Oinn and Rufus Pollock

IATI Tools
maintained by Mark Brough

Listify
maintained by Rufus Pollock

TimeMapper
maintained by Rufus Pollock

BundesGit
maintained by Stefan Wehrmeyer

Kartograph
maintained by Gregor Aisch

MessyTables
maintained by David Raznick, Friedrich Lindenberg, Dominik Moritz

Data Converters
maintained by Nigel Babu, Rufus Pollock

WikipediaJS
maintained by Rufus Pollock

For Your Information
maintained by Rowan Crawford

OffenesParlement
maintained by Friedrich Lindenberg

ActivityAPI
maintained by Tom Rees