Evaluating the Performance of StyleGAN2-ADA on Medical Images

Woodland, McKell; Wood, John; Anderson, Brian; Kundu, Suprateek; Lin, E.; Koay, Eugene J.; Odisio, Bruno C.; Chung, Caroline; Kang, Hyunseon C.; Venkatesan, Aradhana M.; Yedururi, Sireesha; De, Brian; Lin, Yuan-Mao; Patel, Ankit; Brock, Kristy K.

doi:10.1007/978-3-031-16980-9_14

Cited by 14 publications

(9 citation statements)

References 25 publications

(25 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Thus, some studies argue that using FID for medical imaging is neither practical nor feasible and suggest replacing the inception network with their own encoding networks 46,47 . Nonetheless, recent studies using StyleGAN2 have reported their results using FID 21,45 , which is different from the approach of using their own encoding networks for FID evaluation in medical imaging. This is because the alternative approach lacks consistency in evaluating and comparing FID because it does not use the same encoding model as ImageNet 21,48 .…”

Section: Discussionmentioning

confidence: 93%

“…These results may appear unsatisfactory when compared with other medical studies. One study 21 reported FID scores of 5.22 (± 0.17) for a liver CT dataset on a StyleGAN2 network with transfer learning from the FFHQ dataset, and FIDs of 10.78, 3.52, 21.17, and 5.39 on the publicly available SLIVER07, ChestX-ray14, ACDC, and Medical Segmentation Decathlon (brain tumors) datasets. In another study 45 , the FID was approximately 20 for synthesized magnetic resonance and CT images.…”

Section: Discussionmentioning

confidence: 99%

See 1 more Smart Citation

Evaluating the performance of generative adversarial network-synthesized periapical images in classifying C-shaped root canals

Yang,

Kim,

Ariji

et al. 2023

Sci Rep

View full text Add to dashboard Cite

This study evaluated the performance of generative adversarial network (GAN)-synthesized periapical images for classifying C-shaped root canals, which are challenging to diagnose because of their complex morphology. GANs have emerged as a promising technique for generating realistic images, offering a potential solution for data augmentation in scenarios with limited training datasets. Periapical images were synthesized using the StyleGAN2-ADA framework, and their quality was evaluated based on the average Frechet inception distance (FID) and the visual Turing test. The average FID was found to be 35.353 (± 4.386) for synthesized C-shaped canal images and 25.471 (± 2.779) for non C-shaped canal images. The visual Turing test conducted by two radiologists on 100 randomly selected images revealed that distinguishing between real and synthetic images was difficult. These results indicate that GAN-synthesized images exhibit satisfactory visual quality. The classification performance of the neural network, when augmented with GAN data, showed improvements compared with using real data alone, and could be advantageous in addressing data conditions with class imbalance. GAN-generated images have proven to be an effective data augmentation method, addressing the limitations of limited training data and computational resources in diagnosing dental anomalies.

show abstract

Section: Discussionmentioning

confidence: 93%

Section: Discussionmentioning

confidence: 99%

Evaluating the performance of generative adversarial network-synthesized periapical images in classifying C-shaped root canals

Yang,

Kim,

Ariji

et al. 2023

Sci Rep

View full text Add to dashboard Cite

show abstract

“…The FID has been reported to be over-reliant on texture and is argued not to be directly transferable to medical images (Hong et al, 2021). Other works argue that FID aligns well with visual quality analyses (Woodland et al, 2022). In addition to FID, we therefore use the MedicalNet (Chen et al, 2019) for feature extraction before calculating the Fréchet Distance.…”

Section: Discussionmentioning

confidence: 99%

Generative Modeling of the Circle of Willis Using 3D-StyleGAN

Aydin,

Hilbert,

Koch

et al. 2024

Preprint

View full text Add to dashboard Cite

The circle of Willis (CoW) is a network of cerebral arteries with significant inter-individual anatomical variations. Deep learning has been used to characterize and quantify the status of the CoW in various applications for the diagnosis and treatment of cerebrovascular disease. In medical imaging, the performance of deep learning models is limited by the diversity and size of training datasets. To address medical data scarcity, generative adversarial networks (GANs) have been applied to generate synthetic vessel neuroimaging data. However, the proposed methods produce synthetic data with limited anatomical fidelity or downstream utility in tasks concerning vessel characteristics.We adapted the StyleGANv2 architecture to 3D to synthesize Time-of-Flight Magnetic Resonance Angiography (TOF MRA) volumes of the CoW. For generative modeling, we used 1782 individual TOF MRA scans from 6 open source datasets. To train the adapted 3D StyleGAN model with limited data we employed differentiable data augmentations and used mixed precision and a cropped region of interest of size 32×128×128 to tackle computational constraints. The performance was evaluated quantitatively using the Fréchet Inception Distance (FID), MedicalNet distance (MD) and Area Under the Curve of the Precision and Recall Curve for Distributions (AUC-PRD). Qualitative analysis was performed via a visual Turing test. We demonstrated the utility of generated data in a downstream task of multiclass semantic segmentation of CoW arteries. Vessel segmentation performance was assessed quantitatively using the Dice coefficient and the Hausdorff distance.The best-performing 3D StyleGANv2 architecture generated high-quality and diverse synthetic TOF MRA volumes (FID: 12.17, MD: 0.00078, AUC-PRD: 0.9610). Multiclass vessel segmentation models trained on synthetic data alone achieved comparable performance to models trained using real data in most arteries.In conclusion, generative modeling of the Circle of Willis via synthesis of 3D TOF MRA data paves the way for generalizable deep learning applications in cerebrovascular disease. In the future, the extensions of the provided methodology to other medical imaging problems or modalities with the inclusion of pathological datasets has the potential to advance the development of more robust models for clinical applications.

show abstract

“…In addition to the above automated methods, evaluation approaches, which involve humans/experts, have also been used, for example, the visual turing test [ 41 ], five-point Likert scale [ 38 ], and human eye perceptual evaluation (HYPE) [ 39 ]. Although these methods are considered the most accurate methods and are the gold standard, they are costly and time-consuming.…”

Section: Related Workmentioning

confidence: 99%

Evaluating Synthetic Medical Images Using Artificial Intelligence with the GAN Algorithm

Abdusalomov

Nasimov

Nasimova

et al. 2023

Sensors

View full text Add to dashboard Cite

In recent years, considerable work has been conducted on the development of synthetic medical images, but there are no satisfactory methods for evaluating their medical suitability. Existing methods mainly evaluate the quality of noise in the images, and the similarity of the images to the real images used to generate them. For this purpose, they use feature maps of images extracted in different ways or distribution of images set. Then, the proximity of synthetic images to the real set is evaluated using different distance metrics. However, it is not possible to determine whether only one synthetic image was generated repeatedly, or whether the synthetic set exactly repeats the training set. In addition, most evolution metrics take a lot of time to calculate. Taking these issues into account, we have proposed a method that can quantitatively and qualitatively evaluate synthetic images. This method is a combination of two methods, namely, FMD and CNN-based evaluation methods. The estimation methods were compared with the FID method, and it was found that the FMD method has a great advantage in terms of speed, while the CNN method has the ability to estimate more accurately. To evaluate the reliability of the methods, a dataset of different real images was checked.

show abstract

Evaluating the Performance of StyleGAN2-ADA on Medical Images

Cited by 14 publications

References 25 publications

Evaluating the performance of generative adversarial network-synthesized periapical images in classifying C-shaped root canals

Evaluating the performance of generative adversarial network-synthesized periapical images in classifying C-shaped root canals

Generative Modeling of the Circle of Willis Using 3D-StyleGAN

Evaluating Synthetic Medical Images Using Artificial Intelligence with the GAN Algorithm

Contact Info

Product

Resources

About