Thesis icon

Thesis

AMBER: a domain-aware template based system for data extraction

Abstract:

The web is the greatest information source in human history, yet finding all offers for flats with gardens in London, Paris, and Berlin or all restaurants open after a screening of the latest blockbuster remain hard tasks – as that data is not easily amenable to processing. Extracting web data into databases for easier processing has been a resource-intensive process, requiring human supervision for every source from which to extract. This has been changing with approaches that replace hum...

Expand abstract

Actions


Access Document


Files:

Authors


More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Computer Science
Research group:
DIADEM
Oxford college:
Trinity College
Role:
Author

Contributors

Role:
Supervisor
Publication date:
2015
Type of award:
DPhil
Level of award:
Doctoral
Awarding institution:
Oxford University, UK
Language:
English
Keywords:
Subjects:
UUID:
uuid:ff49d786-bfd8-4cd4-a69c-19e81cb95920
Local pid:
ora:12258
Deposit date:
2015-09-23

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP