Date and time: March 28 2013 (10:30 – 12:00).
Room: 336 RB Non–standard venue!
Strigil: A Framework for Data Extraction
- Jakub Stárka, KSI MFF UK
The talk describes Strigil, a system for script-driven data retrieval from textual or weak-structured documents. In particular we identify the most common problems and we discuss some of the possible solutions and workarounds. Additionally the talk involves a description of our proposed scripting language, which allows to define the way how the data should be extracted.
Downloads: slides 1