Loading Data

GraphDB exposes multiple interfaces for loading RDF data. It also supports the conversion of tabular data into RDF and its direct load into an active repository, using simple SPARQL queries and a virtual endpoint. This functionality is based on OpenRefine, and the supported formats are TSV, CSV, Excel (.xls and .xlsx), JSON, XML, or Google Sheet.

GraphDB’s data loading interfaces

Interface

Use cases

Mode

Speed

SPARQL endpoint

No limits on the file size

Online parallel

Moderate speed

Workbench import a local or a remote RDF file

Small files limited up to 200MB

Online parallel

Moderate speed

Workbench import a server file

No limits on the file size

Online parallel

Fast ignoring all HTTP protocol overheads

LoadRDF

Batch import of very big files

Initial offline import with no plugins

Fast with a small speed degradation

Preload

Import huge datasets with no inference

Initial offline import with no inference and plugins

Ultra fast without speed degradation

OntoRefine

Import and clean non RDF based formats

In-memory operation limited to the available heap

Slow