Faculty of Informatics and Statistics, Department of Information and Knowledge Engineering (DIKE)

Date and time: March 28 2013 (10:30 – 12:00).

Room: 336 RB Non–standard venue!

Presentations

Strigil: A Framework for Data Extraction

Speaker

  • Jakub Stárka, KSI MFF UK

The talk describes Strigil, a system for script-driven data retrieval from textual or weak-structured documents. In particular we identify the most common problems and we discuss some of the possible solutions and workarounds. Additionally the talk involves a description of our proposed scripting language, which allows to define the way how the data should be extracted.

Downloads: slides 1 

Powered by Resource Description Framework (RDF)