SkillAgentSearch skills...

ExoMiner

Automating the vetting and validation of planet candidates from photometry survey missions - Kepler and TESS - using deep learning methods

Install / Use

/learn @nasa/ExoMiner

README

ExoMiner

<div style="text-align: center;"> <img src="/others/images/exominer_logo.png" width="250" height="250" alt="Exominer Logo"> </div>

Written by Miguel Martinho (miguel.martinho@nasa.gov)

Introduction

This project's mission is to develop, test, and deploy automated machine learning-based methods to sift ('mine') through transit photometry data from exoplanet survey missions such as Kepler and TESS and inform subject matter experts (SMEs) on potential transiting planet candidates.

Current main goals

The main goals of the ExoMiner project are:

  1. Perform classification of transit signals in Kepler and TESS data;
  2. Create vetted catalogs of Threshold Crossing Events (TCEs) for Kepler and TESS runs for the exoplanet community.
  3. Validate new exoplanets using Kepler and TESS data.

ExoMiner Pipeline [NEW]

Pipeline Demo - Show pipeline run using the terminal

The ExoMiner Pipeline is a fully integrated pipeline from TIC IDs to ExoMiner prediction scores for the corresponding TESS SPOC TCEs in 2-min/FFI data. This pipeline makes use of Podman as a container framework to provide a seamless experience to the user - no need for setting up the code repository and install package dependencies! Simply get the Podman image for your system's architecture using the manifest. See the documentation in here to get started.

For those interested in running the pipeline without resorting to Podman, see Running the pipeline without Podman.

Repository Overview

The repository consists of the following main blocks:

  1. Data preprocessing: preprocess data products (e.g. light curve FITS files) to generate a catalog of transit signal features to be used for training and evaluating models, and to run inference on it. Code under src_preprocessing.
  2. Model training/evaluation/prediction: train and evaluate models on the preprocessed data. Run inference on the data using trained models. Code under src.
  3. Model evaluation using cross-validation under src_cv.
  4. Hyperparameter Optimization: use code under src_hpo to run hyperparameter optimization experiments to find an optimized architecture for a given task.
  5. Model Development: use code under models to access and modify ExoMiner architectures.

Source Data

All data used in this project are publicly available. Generally, the data used consist of:

  • TCE and Objects of Interest (e.g. KOI and TOI catalogs) tables available in archives/respositories such as NExSci, ExoFOP, TEV and MAST;
  • Light curve and target pixel FITS files created by the Kepler/TESS Missions, and data products generated by the Kepler/TESS Science Processing Operations Center (SPOC) available in archives such as the MAST.

Models

Models currently implemented in models:

  1. ExoMiner++ [CURRENT]: Improved architecture for TESS. Used in TESS paper (see References).

ExoMiner++ architecture.

References

For more detailed information see the following publications:

Credits

This work was developed by members of the Data Sciences Group, DASH, Intelligent Systems Division (Code-TI) at NASA Ames Research Center (NASA ARC).

  • Main Contributors

    • Hamed Valizadegan<sup>1,2</sup> (PI), hamed.valizadegan@nasa.gov
    • Miguel Martinho<sup>1,2</sup> (Co-I), miguel.martinho@nasa.gov
  • Collaborators

    • Active Collaborators

      • Doug Caldwell<sup>1,3</sup>
      • Jon Jenkins<sup>1,3</sup>
      • Joseph Twicken<sup>1,3</sup>
    • Past Collaborators

      • Stephen Bryson<sup>1</sup>
      • Jeff Smith<sup>1,3</sup>
  • Developers

    • Active Developers

    • Past Developers

      • Andrés Carranza <sup>2,5</sup> (Unfolded phase time series for transit signal classification)
      • Fellipe Marcellino<sup>2</sup> (Transit detection using Kepler data)
      • Jennifer Andersson<sup>4</sup> (Kepler to TESS transfer learning)
      • Kaylie Hausknecht<sup>2,6</sup> (Explainability framework using Kepler data)
      • Laurent Wilkens<sup>2</sup> (Kepler)
      • Martin Koeling<sup>4</sup> (Kepler to TESS transfer learning)
      • Nikash Walia<sup>2</sup> (Kepler)
      • Noa Lubin <sup>4</sup> (Kepler)
      • Pedro Gerum<sup>4</sup> (Kepler, Kepler non-TCE classification)
      • Patrick Maynard<sup>2,5,7</sup> (Kepler to TESS transfer learning)
      • Sam Donald<sup>4</sup> (Kepler to TESS transfer learning)
      • Theng Yang<sup>2,7</sup> (Label noise detection in Kepler data)
      • Hongbo Wei<sup>2,6,7</sup> (Kepler non-TCE classification, KOI classification, Kepler to TESS transfer learning)
      • Stuti Agarwal <sup>6</sup> (Difference image)
      • Joshua Belofsky <sup>2,5</sup> (Difference image)
      • Charles Yates <sup>2,5</sup> (Unfolded phase time series for transit signal classification, Kepler to TESS transfer learning)
      • William Zhong <sup>5</sup> (Difference Image)
      • Ashley Raigosa<sup>7</sup> (TESS SPOC FFI)
      • Saiswaroop Thammineni<sup>7</sup> (Transit Encoding)
      • Kunal Malhotra<sup>7</sup> (Transit Detection)
      • Eric Liang<sup>7</sup> (Transit Encoding)
      • Ujjawal Prasad<sup>8</sup> (Transit Detection)
      • Adithya Giri<sup>7</sup> (Brown Dwarfs vs Planets Classification; Structured and Adversarial Training for Transit Classification Robustness)
      • Josue Ochoa<sup>7</sup> (Transit Detection)
      • Aniket Mittal<sup>7</sup> (XAI for ExoMiner)

1 - NASA Ames Research Center (NASA ARC)
2 - KBR Inc.
3 - The SETI Institute
4 - NASA International Internship Program (NASA I<sup>2</sup>)
5 - NASA Internships, Fellowships & Scholarships (NIFS)
6 - Volunteer Internship Program (VIP)
7 - NASA Office of STEM Engagement (OSTEM)
8 - NASA-Chabot High School Learning Experience (NASA-CHSLE)

Acknowledgements

We would like to acknowledge people that in some way supported our efforts:

  • A big thank you to all the interns who dedicated their time to supporting the development of ExoMiner!
  • The SPOC team (Jon Jenkins, Joe Twicken, Doug Caldwell) at NASA Ames/SETI for their insight, knowledge, and support!
  • David Armstrong for an insightful discussion that improved our work.
  • Megan Ansdell for providing information on their code and work on Exonet.
  • The main contributors (HV and MM) are supported through TESS XRP 2022 contract 22-XRP22 2-0173, NASA Academic Services Mission (NAMS) contract number NNA16BD14C as well as the Intelligent Systems Research and Development 3 (ISRDS-3) Contract 80ARC020D0010.
  • Resources supporting this work were provided by the NASA High-End Computing (HEC) Program through the NASA Advanced Supercomputing (NAS) Division at Ames Research Center.
  • This work has made use of data products created by the Kepler/TESS Science Processing Operations Center (SPOC) pipeline at NASA Ames Research Center.
  • This work has made use of data from the European Space Agency (ESA) mission Gaia (https://www.cosmos.esa.int/gaia), processed by the Gaia Data Processing and Analysis Consortium (DPAC, https://www.cosmos.esa.int/web/gaia/dpac/consortium). Funding for the DPAC has been provided by national institutions, in particular the institutions participating in the Gaia Multilateral Agreement.
  • This work includes data collected by the TESS mission. Funding for the TESS mission is provided by the NASA's Science Mission Directorate.
  • This research is based on

Related Skills

View on GitHub
GitHub Stars63
CategoryData
Updated10d ago
Forks16

Languages

Python

Security Score

85/100

Audited on Mar 19, 2026

No findings