Anomaly Detection With Machine Learning In Astronomical Images
Abstract
Observations that push the boundaries have historically fuelled scientific breakthroughs, and these observations frequently involve phenomena that were previously unseen and unidentified. Data sets have increased in size and quality as modern technology advances
at a record pace. Finding these elusive phenomena within these large data sets becomes a tougher challenge with each advancement made. Fortunately, machine learning techniques have proven to be extremely valuable in detecting outliers within data sets. Astronomaly is a framework that utilises machine learning techniques for anomaly detection in astronomy and incorporates active learning to provide target specific results. It is used here to evaluate whether machine learning techniques are suitable to detect anomalies within the optical astronomical data obtained from the Dark Energy Camera Legacy Survey. Using the machine learning algorithm isolation forest, Astronomaly
is applied on subsets of the Dark Energy Camera Legacy Survey (DECaLS) data set. The pre-processing stage of Astronomaly had to be significantly extended to handle real survey data from DECaLS, with the changes made resulting in up to 10% more sources having their features extracted successfully. For the top 500 sources returned, 292 were ordinary sources, 86 artefacts and masked sources and 122 were interesting anomalous sources. A supplementary machine learning algorithm known as active learning enhances the identification probability of outliers in data sets by making it easier to identify target specific sources. The addition of active learning further increases the amount of
interesting sources returned by almost 40%, with 273 ordinary sources, 56 artefacts and 171 interesting anomalous sources returned. Among the anomalies discovered are some merger events that have been successfully identified in known catalogues and several candidate merger events that have not yet been identified in the literature. The results indicate that machine learning, in combination with active learning, can be effective in detecting anomalies in actual data sets. The extensions integrated into Astronomaly pave the way for its application on future surveys like the Vera C. Rubin Observatory Legacy Survey of Space and Time.
Collections
Related items
Showing items related by title, author, creator and subject.
-
Assessing the quality of demographic data on age and sex collected from census 2001, General Household surveys (2004-2007), Labour Force surveys (2005-2007) and Community survey 2007 in South Africa
Kamleu, Germaine (University of the Western Cape, 2012)In many countries, an enumeration of all household members remains the most important source of population statistics. According to Statistics South Africa, two population censuses and quite a few household surveys have ... -
Factors affecting the enrolment rate of students in higher education institutions in the Gauteng province, South Africa : based on General Household Survey 2012
Matsolo, Tlou Mpho Joyce (University of the Western Cape, 2015)Background: In South Africa, many students are not able to register at higher education institutions after receiving their high school diploma. The majority of those who do register do not even complete their tertiary ... -
Health inequalities of children in sub-Saharan Africa from 1990 to 2010 : comparative analysis using data from Health and Demographic Surveys
Bado, Aristide Romaric (University of the Western Cape, 2016)This study is based on the assumption that the under-five mortality rate, in recent decades, has declined, particularly in developing countries. However, all the social strata across many countries do not seem to benefit ...