Most of the Web of Data is limited to a large compendium of encyclopedic knowledge describing entities. The timely and massive extraction of RDF facts from unstructured data is a huge challenge. The speaker addresses the problem by presenting an approach that allows for extracting RDF triples from unstructured data streams. The approach employs statistical methods in combination with de-duplication, disambiguation and both unsupervised and supervised machine learning techniques to create a knowledge base that reflects the content of the input streams.
URL: http://videolectures.net/iswc2013_ngonga_ngomo_data_streams/
Keywords: Streaming data, Unstructured data, Named-entity extraction, Part-of-speech (POS) tagging, Machine learning
Author: Ngonga, Axel-Cyrille Ngomo
Date created: 2013-11-28 05:00:00.000
Language: http://id.loc.gov/vocabulary/iso639-2/eng
Time required: P15M
Educational use: professionalDevelopment
Educational audience: professional
Interactivity type: expositive
- Understands that resources are declared to be members (instances) of classes using the property rdf:type.
- Fundamentals of Resource Description Framework
- RDF data model
- Understands that resources are declared to be members (instances) of classes using the property rdf:type.
- Understands that resources are declared to be members (instances) of classes using the property rdf:type.
- RDF data model
- Fundamentals of Resource Description Framework
- Understands the role of formally declared domains and ranges for inferencing.
- Interacting with RDF data
- Reasoning over RDF data
- Understands the role of formally declared domains and ranges for inferencing.
- Understands the role of formally declared domains and ranges for inferencing.
- Reasoning over RDF data
- Interacting with RDF data
- Uses available resources for named entity recognition, extraction, and reconciliation.
- Creating and transforming Linked Data
- Mapping and enriching RDF data
- Uses available resources for named entity recognition, extraction, and reconciliation.
- Uses available resources for named entity recognition, extraction, and reconciliation.
- Mapping and enriching RDF data
- Creating and transforming Linked Data