Automatic Metadata Extraction from Multilingual Enterprise Content

File Type:

PDF

Item Type:

Conference Paper

Date:

2010

Author:

WADE, VINCENT PATRICK

SAH, MELIKE

Citation:

Melike Sah and Vincent Wade, Automatic Metadata Extraction from Multilingual Enterprise Content, ACM International Conference on Information and Knowledge Management (CIKM 2010), Toronto, Canada, 2010, 1665 - 1668

Download Item:

p1665-sah.pdf (Published (publisher's copy) - Peer Reviewed) 497.2Kb

Abstract:

Enterprises provide professionally authored content about their products/services in different languages for use in web sites and customer care. For customer care, personalization/personalized information delivery is becoming important since it re-encourages users to return to the service provider. Personalization usually requires both contextual and descriptive metadata. But current metadata authored by content developers is usually quite simple. In this paper, we introduce an automatic metadata extraction framework, which can extract multilingual metadata from the enterprise content, for a personalized information retrieval system. We introduce two new ontologies for metadata creation and a novel semi-automatic topic vocabulary extraction algorithm. We demonstrate and evaluate our approach on the English and German Symantec Norton 360 technical content. Evaluations indicate that the proposed approach produces rich and high quality metadata for a personalized information retrieval system.

URI:

http://hdl.handle.net/2262/49078

Sponsor

Grant Number

Science Foundation Ireland

Author's Homepage:

http://people.tcd.ie/sahm
http://people.tcd.ie/vwade

Description:

PUBLISHED
Toronto, Canada

Author: WADE, VINCENT PATRICK; SAH, MELIKE

Other Titles:

ACM International Conference on Information and Knowledge Management (CIKM 2010)

Type of material:

Conference Paper

URI:

http://hdl.handle.net/2262/49078

Collections:

Availability:

Full text available

Keywords:

Computer Science, Metadata generation

Show full item record

Licences:

Original License

Browse

All of TARA

This Collection

Statistics