http://markus.relix.de/index.php?title=Approximative_Queries_on_Semistructured_Corpora&feed=atom&action=historyApproximative Queries on Semistructured Corpora - Versionsgeschichte2024-03-29T13:44:12ZVersionsgeschichte dieser Seite in Markus' WikiMediaWiki 1.28.0http://markus.relix.de/index.php?title=Approximative_Queries_on_Semistructured_Corpora&diff=52&oldid=prevMarkus: Die Seite wurde neu angelegt: == Background == This dissertation project has a long history. It issues from an occupation at Regensburg University where I investigated how historians could use digit...2009-02-22T21:43:55Z<p>Die Seite wurde neu angelegt: == Background == This dissertation project has a long history. It issues from an occupation at Regensburg University where I investigated how historians could use digit...</p>
<p><b>Neue Seite</b></p><div>== Background ==<br />
This dissertation project has a long history. It issues from an occupation at Regensburg University where I investigated how historians could use digital corpora. I found that the most fundamental precondition was the availability of a search engine. My Master's Thesis started out to write such one. My corpus was the PHI / TLG corpus containing Greek and Latin texts. Unfortunately this corpus did not seem to receive any further development attention, but I had heard that there were plans to convert it into XML. But that was an issue in the period after I quit University. <br />
<br />
Later I decided that it was worth to have another look at the topic and to redesign my search engine from ground so that it was able to research in non-orthographic, XML-encoded historical texts. <br />
<br />
Eventually, after meandering over diverse sub-issues, I ended up modifying the open source XML database BerkeleyDB-XML by creating an approximative matching funktion as part of XPath. <br />
<br />
== Publications ==<br />
<br />
See my [[Writing|list of publications]].<br />
<br />
== Links ==<br />
<br />
* [http://www.oracle.com/technology/products/berkeley-db/xml/index.html Oracle Berkeley DB XML]</div>Markus