Loading data¶
GraphDB exposes multiple interfaces for loading RDF data. It also supports the conversion of tabular data into RDF and its direct load into an active repository, using simple SPARQL queries and a virtual endpoint. This functionality is based on OpenRefine and the supported formats are TSV, CSV, Excel (.xls and. xlsx), JSON, XML or Google sheet.
Interface | Use cases | Mode | Speed |
---|---|---|---|
SPARQL endpoint | No limits on the file size | Online parallel | Moderate speed |
Workbench import a local or a remote RDF file | Small files limited up to 200MB | Online parallel | Moderate speed |
Workbench import a server file | No limits on the file size | Online parallel | Fast ignoring all HTTP protocol overheads |
LoadRDF | Batch import of very big files | Initial offline import with no plugins | Fast with a small speed degradation |
Preload | Import huge datasets with no inference | Initial offline import with no inference and plugins | Ultra fast without speed degradation |
OntoRefine | Import and clean non RDF based formats | In memory operation limited to the available heap | Slow |