Mike Walmsley
Gather.town id
MLA15
Poster Title
Practical Morphology Tools from Deep Supervised Representation Learning
Institution
University of Manchester
Abstract (short summary)
Deep learning relies on finding meaningful representations of data. These representations are particularly important for galaxy morphology tasks, where complex images are difficult to interpret directly. We argue that the recently-created Galaxy Zoo DECaLS model, trained to accurately answer every Galaxy Zoo question simultaneously, has learned a meaningful representation of morphology that is useful for new tasks. We exploit this to provide several open-source tools for investigating large galaxy samples. These are aimed at researchers hoping to exploit deep learning approaches for their own challenges but without the capacity for citizen-science-scale labeling.
These tools are; a similarity search web interface, to identify galaxies of similar morphology to a query galaxy; an active learning anomaly detection algorithm (extending astronomaly), to identify the most interesting anomalies to a particular researcher; and a morphology transfer learning Python package, to build classifiers from only a few hundred labelled examples.
We develop and demonstrate the performance of our tools using 911,442 galaxies imaged by DECaLS. This includes producing the first large-scale catalogue of ring galaxies, identified using transfer learning and 212 examples tagged by volunteers on the Galaxy Zoo forum.
These tools are; a similarity search web interface, to identify galaxies of similar morphology to a query galaxy; an active learning anomaly detection algorithm (extending astronomaly), to identify the most interesting anomalies to a particular researcher; and a morphology transfer learning Python package, to build classifiers from only a few hundred labelled examples.
We develop and demonstrate the performance of our tools using 911,442 galaxies imaged by DECaLS. This includes producing the first large-scale catalogue of ring galaxies, identified using transfer learning and 212 examples tagged by volunteers on the Galaxy Zoo forum.
Plain text (extended) Summary
Please view this interactive poster at bit.ly/decals_viz. If you are vision-impaired and cannot view the visualization, here is a summary.
We use a Bayesian convolutional neural network to predict every Galaxy Zoo question with 99% accuracy (where the volunteers are confident). The website uses sliders to allow the user to filter the GZ DECaLS galaxies according to the model predictions, and (hopefully) this filtering is visually effective.
Interested in using our pretrained model as a starting point for your own classifier? We made a package!
docs: zoobot.readthedocs.io/
code: github.com/mwalmsley/zoobot
data: zenodo.org/record/4573248
arxiv: 2102.08414
I hope these links are more accessible to you than a pdf.
URL
michael.walmsley@manchester.ac.uk
Poster file