The University of Dublin | Trinity College -- Ollscoil Átha Cliath | Coláiste na Tríonóide
Trinity's Access to Research Archive
Home :: Log In :: Submit :: Alerts ::

School of Computer Science and Statistics >
Computer Science >
Computer Science (Scholarly Publications) >

Please use this identifier to cite or link to this item:

Title: Automatic Metadata Extraction from Multilingual Enterprise Content
Other Titles: ACM International Conference on Information and Knowledge Management (CIKM 2010)
Sponsor: Science Foundation Ireland
Author's Homepage:
Keywords: Computer Science
Metadata generation
Issue Date: 2010
Citation: Melike Sah and Vincent Wade, Automatic Metadata Extraction from Multilingual Enterprise Content, ACM International Conference on Information and Knowledge Management (CIKM 2010), Toronto, Canada, 2010, 1665 - 1668
Abstract: Enterprises provide professionally authored content about their products/services in different languages for use in web sites and customer care. For customer care, personalization/personalized information delivery is becoming important since it re-encourages users to return to the service provider. Personalization usually requires both contextual and descriptive metadata. But current metadata authored by content developers is usually quite simple. In this paper, we introduce an automatic metadata extraction framework, which can extract multilingual metadata from the enterprise content, for a personalized information retrieval system. We introduce two new ontologies for metadata creation and a novel semi-automatic topic vocabulary extraction algorithm. We demonstrate and evaluate our approach on the English and German Symantec Norton 360 technical content. Evaluations indicate that the proposed approach produces rich and high quality metadata for a personalized information retrieval system.
Description: PUBLISHED
Toronto, Canada
Related links:
Appears in Collections:Computer Science (Scholarly Publications)

Files in This Item:

File Description SizeFormat
p1665-sah.pdfPublished (publisher's copy) - Peer Reviewed497.2 kBAdobe PDFView/Open

This item is protected by original copyright

Please note: There is a known bug in some browsers that causes an error when a user tries to view large pdf file within the browser window. If you receive the message "The file is damaged and could not be repaired", please try one of the solutions linked below based on the browser you are using.

Items in TARA are protected by copyright, with all rights reserved, unless otherwise indicated.


Valid XHTML 1.0! DSpace Software Copyright © 2002-2010  Duraspace - Feedback