Data-driven Evaluation of Deep Generative Models in Biomedical Imaging