Loading data

GraphDB exposes multiple interfaces for loading RDF data. It also supports the conversion of tabular data into RDF and its direct load into an active repository, using simple SPARQL queries and a virtual endpoint. This functionality is based on OpenRefine and the supported formats are TSV, CSV, Excel (.xls and. xlsx), JSON, XML or Google sheet.

GraphDB’s data loading interfaces.
Interface Use cases Mode Speed
SPARQL endpoint No limits on the file size Online parallel Moderate speed
Workbench import a local or a remote RDF file Small files limited up to 200MB Online parallel Moderate speed
Workbench import a server file No limits on the file size Online parallel Fast ignoring all HTTP protocol overheads
LoadRDF Batch import of very big files Initial offline import with no plugins Fast with a small speed degradation
Preload Import huge datasets with no inference Initial offline import with no inference and plugins Ultra fast without speed degradation
OntoRefine Import and clean non RDF based formats In memory operation limited to the available heap Slow