HOW IT WORK?

1. Initial Data and Purpose

The model is based on light curve data from exoplanets obtained from missions like Kepler and TESS. The light curve shows how a star’s brightness changes over time. When a planet passes in front of its star from our perspective, it causes a small decrease in brightness, creating a dip in the light curve. This phenomenon is used to identify potential exoplanets.

2. Metadata and Data Preprocessing

To train a machine learning model, it’s necessary to have structured and processed data. In this case, the following essential data is gathered for each star to be analyzed:

The preprocessing of this data involves several stages, such as:

Generation of two views:

3. Machine Learning Model

With preprocessed data, the machine learning model is trained using a set of convolutional neural networks (CNNs), which are highly effective for processing and analyzing images or time-sequence data. The model takes two "numerical images," corresponding to the global and local views of the light curve, and looks for patterns within them that indicate the presence of a planetary transit.

Training steps include:

4. Model Evaluation

During training, metrics such as AUC (Area Under the ROC Curve), Precision, and Recall are used to evaluate the model's performance. Precision measures how many of the positive predictions are actually correct, while recall measures how many of the true events (transits) the model successfully detects.

Additionally, the decision threshold is adjusted to optimize the balance between precision and recall. Depending on the project's objectives, recall can be prioritized to ensure fewer exoplanets are missed, although this might increase the number of false positives.

5. Model Application

Once trained and validated, the model can be used to predict the probability that new light curves correspond to a planetary transit. These models can process large amounts of data from missions like Kepler and TESS, providing an automated and efficient analysis.

6. Considerations for Identifying Exoplanets

The model doesn’t just rely on the presence of a dip in brightness to identify a transit, but also on additional characteristics that help differentiate between real transits and false positives, such as: