%0 Conference Proceedings %T MetaExtractor: A System for Metadata Extraction from Structured Data Sources %+ Pontificia Universidad Javeriana (PUJ) %A Pomares-Quimbaya, Alexandra %A Torres-Moreno, Miguel, Eduardo %A Roldán, Fabián %Z Part 1: Cross-Domain Conference and Workshop on Multidisciplinary Research and Practice for Information Systems (CD-ARES 2013) %< avec comité de lecture %( Lecture Notes in Computer Science %B 1st Cross-Domain Conference and Workshop on Availability, Reliability, and Security in Information Systems (CD-ARES) %C Regensburg, Germany %Y Alfredo Cuzzocrea %Y Christian Kittl %Y Dimitris E. Simos %Y Edgar Weippl %Y Lida Xu %I Springer %3 Availability, Reliability, and Security in Information Systems and HCI %V LNCS-8127 %P 84-99 %8 2013-09-02 %D 2013 %K Large Scale Data Mediation %K Metadata Extraction %K Source Selection %K Multi-Agent System %K Mediation Systems %Z Computer Science [cs] %Z Humanities and Social Sciences/Library and information sciencesConference papers %X The extraction of metadata used during the planning phase in mediation systems assumes the existence of a metadata repository that in most cases must be created with high human involvement. This dependency rises complexity of maintenance of the system and therefore the reliability of the metadata itself. This article presents MetaExtractor, a system which extracts structure, quality, capability and content metadata of structured data sources available on a mediation system. MetaExtractor is designed as a Multi-Agent System(MAS) where each agent specializes in the extraction of a particular type of metadata. The MAS cooperation capability allows the creation and maintenance of the metadata repository. MetaExtractor is useful to reduce the number of data sources selected during query planning in large scale mediation systems due to its ability to prioritize data sources that better contribute to answer a query. The work reported in this paper presents the general architecture of MetaExtractor and emphasizes on the extraction logic of content metadata and the strategy used to prioritize data sources accordingly to a given query. %G English %2 https://inria.hal.science/hal-01506797/document %2 https://inria.hal.science/hal-01506797/file/978-3-642-40511-2_7_Chapter.pdf %L hal-01506797 %U https://inria.hal.science/hal-01506797 %~ SHS %~ IFIP-LNCS %~ IFIP %~ IFIP-TC %~ IFIP-TC5 %~ IFIP-WG %~ IFIP-TC8 %~ IFIP-CD-ARES %~ IFIP-WG8-4 %~ IFIP-WG8-9 %~ IFIP-LNCS-8127