Ontology Learning and Population from Text - Algorithms, Evaluation and Applications

Ontology Learning and Population from Text - Algorithms, Evaluation and Applications

von: Philipp Cimiano

Springer-Verlag, 2006

ISBN: 9780387392523

Sprache: Englisch

362 Seiten, Download: 19046 KB

 
Format:  PDF, auch als Online-Lesen

geeignet für: Apple iPad, Android Tablet PC's Online-Lesen PC, MAC, Laptop


 

eBook anfordern

Mehr zum Inhalt

Ontology Learning and Population from Text - Algorithms, Evaluation and Applications



10 Contribution and Outlook (p. 309-310)

This book contributes to the state-of-the-art in ontology learning in several ways. First, we have provided a formal definition of ontology learning tasks with respect to a well-defined ontology model. The ontology learning layer cake, a model for representing the diverse subtasks in ontology learning has been introduced. In addition, evaluation measures for the concept hierarchy induction, relation learning as well as ontology population tasks have been defined. These evaluation measures provide a basis in order to compare different approaches performing a certain task. Most importantly, several original and novel approaches performing a certain task have been presented and compared to other state-of-the-art approaches from the literature using the defined evaluation measures.

Concerning the concept hierarchy induction task, we have presented a novel approach based on Formal Concept Analysis, an original guided agglomerative clustering method as well as a combination approach for the induction of concept hierarchies from text. All the approaches have been evaluated and have been demonstrated to actually outperform current state-of-the-art methods. We have further introduced and discussed several approaches to learning attributes and relations. In particular, we have presented approaches to learn i) attributes, ii) the appropriate domain and range for relations, as well as iii) specific relations using a pattern-based approach. Several approaches to automatically populate an ontology with instances have also been described. We have in particular examined a similarity-based approach as well as introduced the original approach of Learning By Googling. Corresponding evaluations have also been provided. Finally, we have have also discussed applications for ontology learning approaches and demonstrated for two concrete applications that the techniques developed in the context of this book are indeed useful. Throughout the book, we have also provided a thorough overview of related work.

Fortunately, there are a number of open issues which require further research. On the one hand, though we have undertaken a first step towards combining different ontology learning paradigms via a machine-learning approach. further research is needed in this direction to unveil the full potential of such a combination. In particular, other paradigms than our classification-based approach could be explored. One could imagine to train classifiers for each type of basic ontological relation, i.e. isa, part-of, etc. using different methods and then use a calculus as envisioned by Heyer and colleagues [Heyer et al., 2001] as well as Ogata and Collier [Ogata and Collier, 2004] to combine the results of these classifiers and reason on different types of extracted ontological relations. Such a post-extraction reasoning is in fact crucial as the different approaches can produce contradicting information and thus producing a consistent ontology needs some kind of contradiction resolution approach. In fact, one important problem is to generate the optimal ontology maximizing a certain criterion given a certain amount of - possibly contradicting - relations. Initial blueprints for such an approach can be found, for example, in the work of Haase and Volker [Haase and Volker, 2005]. A lot of further research is however needed in this direction.

Another important issue to be clarified is which similarity measures, which weighting measures and which features work best for the task of clustering words. Though we have provided some insights in the present book, much more work is needed to clarify these issues. In the same vein, further experiments are necessary to clarify the relation between syntactic and semantic similarity such as perceived by humans. These issues can only be approached from an experimental perspective. Though there has been a lot of work on this issue, much further research can be expected.

In general, from a theoretical perspective, it would be necessary to clarify what type of ontologies we can actually learn, i.e. domain ontologies, lexical ontologies, upper-level ontologies, application ontologies, etc. Work in this direction has been presented by Bateman [Bateman, 1991], for instance. In this line, it seems also necessary to ask ourselves about the limits of ontology learning techniques. Furthermore, an integration of ontology learning techniques with linguistic theories, in particular with lexicon theories such as Generative Lexicon [Pustejovsky, 1991] is definitely desirable. In addition, it seems desirable to clarify the relation between ontological and lexical semantics. In the long term, it would definitely be interesting to acquire more complex relationships between concepts and relations in the form of rules or axioms. Last but not least, approaches should actually have reasonable applications. We have argued that it is far from straightforward to devise applications making use of automatically learned ontologies in a reasonable way. The problem lies in the fact that there are a number of parameters to be tuned on which the success of using an ontology depends. However, the quest for applications is a necessary and crucial one. Future research should thus further examine the usefulness of automatically derived knowledge structures for certain applications.

Kategorien

Service

Info/Kontakt