Questions tagged [tdb]

TDB is an open source RDF database developed and maintained by the Apache Jena project

TDB is an open source RDF database developed and maintained by the Apache Jena project. It is a Java based embedded database that may be exposed over HTTP using the Fuseki Server also from the Apache Jena project.

Features

It's features includes the following:

  • Persisting RDF Triples or Quads
  • Full SPARQL execution
  • Write ahead logging to provide serializable transactions and fault tolerance

Scalability and Performance

TDB typically scales up to datasets of a few hundred million triples or quads.

Compared to commercial alternatives TDB is often less scalable compared primarily because while it is persisted to disk it is fundamentally designed as an in-memory database. It relays heavily on RAM caches and memory mapped files so scalability tends to be limited by machine RAM.

Also TDB does not have any option of a clustered mode so cannot be scaled horizontally without additional technologies (e.g. manually created replicas and load balancers)

TDB is typically included in the Berlin SPARQL Benchmark Results for those interested in comparative performance data.

137 questions
1
vote
2 answers

Java OutOfMemoryError in apache Jena using TDB

Hi I've been using Jena for a project and now I am trying to query a Graph for storage in plain files for batch processing with Hadoop. I open a TDB Dataset and then I query by pages with LIMIT and OFFSET. I output files with 100000 triplets per…
Nord
  • 253
  • 2
  • 6
1
vote
1 answer

Can I use Apache Jena and persist the ontology using Apache Solr

We have a cloud base Java application that uses Oracle DB and Apache Solr for document indexing/searching. I need to implement an ontology and I intend to use Apache Jena. It's an uncharted territory for me. According to the docs, seems that using…
Jimo
  • 143
  • 2
  • 14
1
vote
1 answer

Does Jena TDB load all data into memory every time?

I am a newbie of Jena. I try to deal with the Yoga dataset using TDB. The dataset is about 200M and everytime I run the same query, it will have to take about 5 minutes to load the data then give out the results. I am wondering do I misunderstand…
Charlotte
  • 93
  • 11
1
vote
1 answer

command line tdbquery with text index

I trying to run a text search query with Jena via command line. tdbquery --desc textsearch.ttl --query search.rq The query return empty results with the messages: 17:23:46 WARN TextQueryPF :: Failed to find the text index : tried…
1
vote
0 answers

Illegal utf-8 exception while querying apache jena with tdbquery

I use Apache Jena to query RDF Data from the Billion Triple Challange 2014 Dataset. I loaded the dataset into Jena with tdbloader. I especially use queries which contain property paths with tdbquery. When I start such a query I often get the…
SNaidra
  • 21
  • 4
1
vote
2 answers

Querying named RDF graphs in TDB using tdbquery

I am trying to query my newly created TDB database use the tdbquery program. However, I am having a hard time writing a query that targets the correct named graph. I am doing the following: First a create a new dataset and add a name graph called…
1
vote
1 answer

Query multiple models within same Dataset in TDB Jena

I have an ontology (say dgo.owl) and another file containing individuals for same ontology stored in rdf format. I have stored both (ontology and instances/individual file) files in TDB under different named models (say dgo_ont and homedata). Now, I…
Haroon Lone
  • 2,837
  • 5
  • 29
  • 65
1
vote
1 answer

Can Jena's TDB be replaced with custom persistence layer?

I'd like to use Jena but not with TDB nor SDB, I already have a custom Triples Database (currently persistence is done with Cassandra). Can I use Jena to load / manipulate RDF models from input and store them on my custom DB? In other words: What…
user40171
  • 155
  • 5
1
vote
1 answer

TDBLoader API generic input file

I'm trying to save a bulk file into my TDB through TDBLoader into Jena API but when code is saving into my TDB it gives my this error org.apache.jena.riot.RiotException: [line: 1, col: 1 ] Expected BNode or IRI: Got: [DIRECTIVE:prefix] at…
Claudio Pomo
  • 2,392
  • 7
  • 42
  • 71
1
vote
0 answers

java.lang.OutOfMemoryError: GC overhead limit exceeded when I load model RDF in TDB TripleStore

I have a question: I have written this post where I said that I had the problems when I loaded large RDF file in my RDF Triple Store. How I can load a model in TDB TripleStore This solutions worked, but today it causes another problems. My part of…
Musich87
  • 562
  • 1
  • 12
  • 31
1
vote
1 answer

How I can load a model in TDB TripleStore

I have a question for you: I would like to load a file on my Jena TDB TripleStore. My file is very big, about 80Mb and about 700000 triples RDF. When I try to load it, the execution stops working or takes a very long time. I'm using this code that I…
Musich87
  • 562
  • 1
  • 12
  • 31
1
vote
0 answers

Jena TDB/Fuseki Performance

I have a simple SPARQL query which executes reasonably fast on my Jena TDB store using a local Fuseki SPARQL endpoint: SELECT DISTINCT ?p WHERE { ?s rdf:type dbpedia-owl:Organisation . ?s ?p dbpedia:California . } LIMIT 10 It takes maybe 10…
Johannes
  • 760
  • 1
  • 7
  • 20
1
vote
1 answer

Jena TDB after reason then update

I am using Jena and I want to update the new ontology into my tdb. For example. I have 100 rows in my ontology , after I add some rules and run the reasoner, there are 105 rows now. And I need to update these 5 additional rows in my tdb. How can I…
darrelltw
  • 1,734
  • 1
  • 10
  • 21
1
vote
1 answer

Query Jena TDB store

I'm trying to query jena TDB in java. My code follows. It seems that my TDB dataset is empty because model.size() (where model is the dataset default model) return 0. I'm sure it isn't empty, though, because I can query it with Fuseki with my…
Ryfa
  • 13
  • 2
1
vote
1 answer

D2R 0.8.1 integration with Jena TDB

I am using D2R server for RDB to RDF conversion. Now I want to save this rdf into Jena TDB backed dataset while I am using D2R server. In short I want to integrate Jena TDB with D2R. Code for RDB to RDF conversion is: public static void main…
Suryansh
  • 11
  • 2