RDF Textual Encoding Framework

RDFTEF is an open source Java framework that supports textual encoding in RDF+OWL. It provides a shell that allows the model to be queried using SPARQL, is able to import existing encoded text in XML TEI format, and can save the model to RDF/XML or export it in form of aspect slices, as XML TEI format.

TEI stands for Text Encoding Initiative , and its Guidelines are an international and interdisciplinary standard that facilitates libraries, museums, publishers, and individual scholars represent a variety of literary and linguistic texts for online research, teaching, and preservation .

However, TEI is XML based, and thus it suffers heavy limitations, such as overlapping annotations. RDF is able to overcome this, using ontology vocabularies and allowing powerful queries.


There are reasons to believe that it would be very interesting to use Semantic Web tools and languages like RDF and OWL for tasks traditionally performed with XML mark-up. Among these, is textual encoding of literary documents and or manuscripts. (see this explorative paper, ELPUB2005). While the idea of using RDF might be very fascinating indeed, there is a huge amount of previous work and legacy standards that would simply make a standardization effort not realistic at the moment. With this project, however, the aim is to allow researcher and interested individuals to experience the the idea, features and possibilities of RDF textual encoding.


This project was developed with the mentoring of SEMEDIA Group and supported by Google's Summer of Code (SoC) program. Check the initial proposal project. Other projects proposed by SEMEDIA to Summer of Code are at it's Project Proposal Page.


RDFTEF is currently alpha software, so portions of the system are incomplete or missing. Feature requests, complaints, bug reports and general comments are welcomed.

RDFTEF is licensed under the GNU General Public License.


RDFTEF currently supports importing and exporting of XML files in a subset of the TEI format. Thus not all tags described at TEI guidelines are managed. A list of the supported tags is provided with the software package (see Download).

The query interface allows queries to be entered in SPARQL Language. Some additional functions are provided so as to allow more interesting and powerful queries. Look at the README file in the software package.


RDFTEF needs the lastest CVS version of Jena2 and ARQ. Working versions of both are provided with RDFTEF, in the lib/ directory. The other packages in lib/ are required by Jena and ARQ.

RDFTEF was developed and tested with Java JDK 1.5. It was not tested with JDK 1.4, though it might work well with it.


Download the last version from the project's CVS.

cvs -d:pserver:anonymous@cvs.sourceforge.net:/cvsroot/rdftef login
cvs -z3 -d:pserver:anonymous@cvs.sourceforge.net:/cvsroot/rdftef co -P RDFTEF2

For more details on how to do this see SourceForge instructions.

Also, take a look at the project's summary page, here at SourceForge.


See the RDFTEF documentation for setup instructions and usage details and examples.


Thank you Giovanni and Christian at SEMEDIA, and Chris, Natalie & co. at Google!

Fabio N. Kepler (fabio DOT kepler AT gmail DOT com)

This project was developed with the mentoring and based on ongoing research of Semedia Group, from the Università Politecnica delle Marche in Ancona, Italy.

Google Summer of Code
This project was supported and funded by Google's Summer of Code Open Source initiative.

RDF Resource Description Framework Ontology Web Language

Hosted at SourceForge

Valid HTML 4.01!

Valid CSS!