Show simple item record

dc.contributor.authorSmolic, Aljosa
dc.contributor.authorGhosal, Koustav
dc.contributor.authorRana, Aakanksha
dc.date.accessioned2020-02-18T17:17:33Z
dc.date.available2020-02-18T17:17:33Z
dc.date.issued2019
dc.date.submitted2019en
dc.identifier.citationK. Ghosal, A. Rana and A. Smolic, "Aesthetic Image Captioning From Weakly-Labelled Photographs," 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW), Seoul, Korea (South), 2019, pp. 4550-4560en
dc.identifier.otherY
dc.identifier.urihttps://v-sense.scss.tcd.ie/wp-content/uploads/2019/08/ICCVW_CROMOL_2019.pdf
dc.identifier.urihttp://hdl.handle.net/2262/91579
dc.descriptionPUBLISHEDen
dc.description.abstractAesthetic image captioning (AIC) refers to the multimodal task of generating critical textual feedbacks for photographs. While in natural image captioning (NIC), deep models are trained in an end-to-end manner using large curated datasets such as MS-COCO, no such large-scale, clean dataset exists for AIC. Towards this goal, we propose an automatic cleaning strategy to create a benchmarking AIC dataset, by exploiting the images and noisy comments easily available from photography websites. We propose a probabilistic caption-filtering method for cleaning the noisy web-data, and compile a large-scale, clean dataset ‘AVACaptions’, ( ∼ 230, 000 images with ∼ 5 captions per image). Additionally, by exploiting the latent associations between aesthetic attributes, we propose a strategy for training a convolutional neural network (CNN) based visual feature extractor, typically the first component of an AIC framework. The strategy is weakly supervised and can be effectively used to learn rich aesthetic representations, without requiring expensive ground-truth annotations. We finally showcase a thorough analysis of the proposed contributions using automatic metrics and subjective evaluations.en
dc.language.isoenen
dc.rightsYen
dc.subjectAesthetic image captioningen
dc.subjectNatural image captioningen
dc.subjectConvolutional neural networksen
dc.titleAesthetic Image Captioning from Weakly-Labelled Photographsen
dc.typeConference Paperen
dc.contributor.sponsorSFI stipenden
dc.type.supercollectionscholarly_publicationsen
dc.type.supercollectionrefereed_publicationsen
dc.identifier.peoplefinderurlhttp://people.tcd.ie/smolica
dc.identifier.rssinternalid212561
dc.identifier.doi10.1109/ICCVW.2019.00556en
dc.rights.ecaccessrightsopenAccess
dc.contributor.sponsorGrantNumber15/RP/2776en
dc.subject.TCDThemeCreative Technologiesen
dc.subject.TCDTagMultimedia & Creativityen
dc.subject.darat_impairmentOtheren
dc.status.accessibleNen


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record