VSG-GAN: A high-fidelity image synthesis method with semantic manipulation in retinal fundus image
File Type:
PDFItem Type:
Journal ArticleDate:
2024Access:
openAccessCitation:
Junjie Liu, Shixin Xu, Ping He, Sirong Wu, Xi Luo, Yuhui Deng, Huaxiong Huang, VSG-GAN: A high-fidelity image synthesis method with semantic manipulation in retinal fundus image, Biophysical Journal, 2024Download Item:
Abstract:
In recent years, advancements in retinal image analysis, driven by machine learning and deep learning
techniques, have enhanced disease detection and diagnosis through automated feature extraction. However, challenges persist, including limited dataset diversity due to privacy concerns and imbalanced sample
pairs, hindering effective model training. To address these issues, we introduce the Vessel & Style Guided
Generative Adversarial Network (VSG-GAN), an innovative algorithm building upon the foundational concept of GAN. In VSG-GAN, a generator and discriminator engage in an adversarial process to produce
realistic retinal images. Our approach decouples retinal image generation into distinct modules: the vascular skeleton and background style. Leveraging style transformation and GAN inversion, our proposed
Hierarchical Variational Autoencoder (HVAE) module generates retinal images with diverse morphological traits. Additionally, the Spatially-Adaptive De-normalization (SPADE) module ensures consistency
between input and generated images. We evaluate our model on MESSIDOR and RITE datasets using
various metrics, including Structural Similarity Index Measure (SSIM), Inception Score (IS), Fr´echet Inception Distance (FID), and Kernel Inception Distance (KID). Our results demonstrate the superiority
of VSG-GAN, outperforming existing methods across all evaluation assessments. This underscores its effectiveness in addressing dataset limitations and imbalances. Our algorithm provides a novel solution to
challenges in retinal image analysis by offering diverse and realistic retinal image generation. Implementing
the VSG-GAN augmentation approach on downstream diabetic retinopathy classification tasks has shown
enhanced disease diagnosis accuracy, further advancing the utility of machine learning in this domain.
Sponsor
Grant Number
European Research Council (ERC)
101002240
National Natural Science Foundation of China
12231004 & 12071190
Guangdong Provincial Key Laboratory of Interdisciplinary Research and Application for Data Science, BNU-HKBU United International College
2022B1212010006
Guangdong University Innovation and Enhancement Programme Funds Featured Innovation Projec
2018KTSCX278
UIC Research Grants
R5201910, R201809, UICR0600036, and UICR0600048
Author's Homepage:
http://people.tcd.ie/liuj13Description:
PUBLISHEDType of material:
Journal ArticleCollections
Series/Report no:
Biophysical Journal;Availability:
Full text availableDOI:
https://doi.org/10.1016/j.bpj.2024.02.019Metadata
Show full item recordLicences: