Show simple item record

dc.contributor.advisorSmolic, Aljosaen
dc.contributor.authorGhosal, Koustaven
dc.date.accessioned2021-08-19T13:56:07Z
dc.date.available2021-08-19T13:56:07Z
dc.date.issued2021en
dc.date.submitted2021en
dc.identifier.citationGhosal, Koustav, Applications in Image Aesthetics Using Deep Learning: Attribute Prediction, Image Captioning and Score Regression, Trinity College Dublin.School of Computer Science & Statistics, 2021en
dc.identifier.otherYen
dc.identifier.urihttp://hdl.handle.net/2262/96848
dc.descriptionAPPROVEDen
dc.description.abstractImage Aesthetics refers to the branch of computer vision which is about the study of aesthetic properties of photographs i.e. the factors which make an image look pleasing or dull. Such factors extend beyond the physical properties of an image such as object category or location to subtler and more nuanced ambiguous concepts such as "candid expression", "harsh lighting", "bad placement" etc. Nevertheless, the problems in Image Aesthetics have traditionally been modelled as classical computer vision tasks such as classification, regression etc. And, as with most other problems in computer vision, deep learning based strategies have proved more effective in this area as well, outperforming the classical approaches by a wide margin. Nowadays, automated systems for Image Aesthetics Analysis have widespread applications from professional multimedia content development to casual creatives in social media and advertising. In this thesis, we study three different applications in Image Aesthetics using deep learning: attribute classification, captioning and score prediction. First, we study the capacity of deep neural networks in capturing the geometric attributes i.e. those which depend on the arrangement of objects within the image. Based on this, we propose a system that predicts the dominant aesthetic attributes in a photograph such as The Rule of Thirds, leading lines etc. Second, we develop an aesthetic image captioning framework by exploiting "in the wild" user feedback from the web. Given an image, our framework generates critical feedback such as "nice composition but the foreground is out of focus". Third, we investigate the limitations of traditional convolutional neural networks with respect to global relational reasoning and handling photographs of arbitrary aspect ratio and resolution. We present a visual attention based graph neural network that addresses these limitations and advances the state-of-the-art in aesthetic score prediction.en
dc.publisherTrinity College Dublin. School of Computer Science & Statistics. Discipline of Computer Scienceen
dc.rightsYen
dc.subjectImage Aesthetics Assessmenten
dc.subjectDeep Learningen
dc.subjectImage Captioningen
dc.subjectMachine Learningen
dc.subjectAesthetic Visual Analysisen
dc.titleApplications in Image Aesthetics Using Deep Learning: Attribute Prediction, Image Captioning and Score Regressionen
dc.typeThesisen
dc.type.supercollectionthesis_dissertationsen
dc.type.supercollectionrefereed_publicationsen
dc.type.qualificationlevelDoctoralen
dc.identifier.peoplefinderurlhttps://tcdlocalportal.tcd.ie/pls/EnterApex/f?p=800:71:0::::P71_USERNAME:GHOSALKen
dc.identifier.rssinternalid232663en
dc.rights.ecaccessrightsopenAccess
dc.contributor.sponsorScience Foundation Ireland (SFI for RF)en


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record