Next Article in Journal
Favourable Conditions for Majorana Phase Appearance in Neutrino Oscillation Probabilities
Previous Article in Journal
SBND Trigger System: Status and MTC/A Configuration
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Proceeding Paper

Measurement of Atmospheric Muon Neutrino Disappearance Using CNN Reconstructions with IceCube †

by
Shiqi Yu
on behalf of the IceCube Collaboration
Department of Physics and Astronomy, Michigan State University, East Lansing, MI 48824, USA
Presented at the 23rd International Workshop on Neutrinos from Accelerators, Salt Lake City, UT, USA, 30–31 July 2022.
Phys. Sci. Forum 2023, 8(1), 62; https://doi.org/10.3390/psf2023008062
Published: 8 October 2023

Abstract

:
The IceCube Neutrino Observatory is a Cherenkov detector located at the South Pole, instrumenting a cubic kilometer of ice. The DeepCore subdetector is located at the lower center of the IceCube array, and has denser configuration that has improved ability to see GeV-scale neutrinos in the detector. Convolutional neural networks (CNN) are used to reconstruct neutrino interactions in DeepCore, achieving comparable performance to the current likelihood-based method but with roughly 3000 times faster processing speeds. In this study, we present a preliminary atmospheric muon neutrino disappearance analysis using the CNN-reconstructed neutrino sample, and the sensitivity to neutrino oscillation parameter measurements is shown and compared to the recent IceCube results.

1. Introduction

The IceCube Neutrino Observatory is a Cherenkov detector located deep under the South Pole ice. It contains 5160 digital optical modules (DOMs), constituting 78 IceCube (IC) strings and 8 DeepCore (DC) strings (Figure 1) with 60 DOMs on each string. The DC strings are in the lower center of the detector.
When cosmic rays enter the atmosphere and interact with a nucleus, atmospheric neutrinos are produced. Those atmospheric neutrinos that interact in or near the IceCube detector produce charged particles. When these charged particles propagate in the ice faster than the speed of light, Cherenkov photons are emitted. The DOMs detect and convert the light signals into digitized waveforms. Using the charge and time information extracted from the waveforms of each DOM, convolutional neural networks (CNNs) are employed to reconstruct the neutrino interactions in the detector. The DOMs in the DC strings have higher quantum efficiency and are spatially denser than DOMs in the surrounding detector. With the DC detector, neutrino interactions can be measured at the GeV scale, providing excellent sensitivity to atmospheric neutrino oscillations and in particular to atmospheric ν μ disappearance.

2. Neutrino Oscillations

Neutrinos are produced and detected in their flavor states, i.e., electron ( ν e ), muon ( ν μ ), or tau ( ν τ ), while they propagate in their mass eigenstates, i.e., ν 1 , ν 2 , and ν 3 . After traveling, neutrinos can be detected in different flavor states from those they were produced in; this is called neutrino oscillation, and has been studied in many experiments [1,2,3,4,5,6]. The probabilities of neutrino oscillations can be described by a unitary matrix, the Pontecorvo–Maki–Nakagawa–Sakata (PMNS) matrix [7,8], as functions of the neutrino energy (E) and distance traveled (L). The PMNS matrix can be parameterized by three mixing angles, the squared-mass difference between neutrino mass states ( Δ m i j 2 with i , j = { 1 , 2 , 3 } ), and one CP-violating phase. In ν μ disappearance analysis, it is possible to measure the oscillation parameters θ 23 and Δ m 32 2 . The ν μ survival probability under two-flavor approximation is described by
P ( ν μ ν μ ) 1 sin 2 ( 2 θ 23 ) sin 2 1.27 Δ m 32 2 L E ,
and plotted as a function of cos ( θ zenith ) and E in Figure 2, where cos ( θ zenith ) is the neutrino arrival angle, which can be mapped into L. The probabilities of neutrino oscillations are functions of (L/E), which suggests that precise reconstruction is critical in neutrino oscillation measurements. CNNs are employed to reconstruct neutrino interactions in and near the DC detector using the digitized pulses of neutrino events.

3. Convolutional Neural Networks

CNNs have been broadly used in neutrino experiments for regression and classification purposes [9,10]. All of the DC strings along with the surrounding nineteen IC strings are used to learn the features of the neutrino interactions in and near the DC detector. Five summarized variables (the total charge, times of the first and last charges, time-weighted mean of the charge, and time-weighted standard deviation of the charge) are calculated from the digitized waveforms. Because the IC and DC strings are configured differently, they are fed to the CNN through two separate input layers, allowing the local features to be better learned and used by the CNN.

4. Reconstruction

4.1. Training

To build the ν μ disappearance analysis, it is necessary to reconstruct the neutrino interactions in the DC detector and select those neutrino events in the selected sample that are well reconstructed with a reasonably high signal-to-background ratio. We employ the CNN to reconstruct the neutrino energy, arrival angle ( θ zenith ), interaction vertex, particle identification (PID), and a classifier for cosmic ray muon rejection. The energy, θ zenith , and PID are used together as the analysis observables, while the event interaction vertex and muon classifier are respectively used to select the events in or near the DC detector and to further reduce the background muon rate. The PID can be inferred from the outgoing particle trace of the event in the detector. Most of the signal ( ν μ CC) events leave a long muon track in the detector, and as such are track-like. The other neutrino interactions, such as ν e CC, NC, and most ν τ CC, produce relatively short particle cascades, and are considered ’cascade-like’. For reconstruction purposes, different training samples are used to optimize the reconstruction performance.
As shown in Figure 3, for the energy, θ zenith , and interaction vertex we use Monte Carlo (MC) simulated ν μ CC events with a relatively flat distribution of the corresponding variable to train the CNN. In this way, the reconstruction performance is optimized on the signal events for the best reconstruction resolution while ensuring that the CNN reconstructions have reasonable performance on the background events ( ν e CC, ν τ CC, and NC events). The neutrino energy and interaction vertex is trained on the flat true energy ν μ CC sample, which has 7 million events in total, split into 80% for training and 20% for testing. The θ zenith is trained on a ν μ CC sample flattened in true θ zenith distribution. The PID is trained on a balanced mixture (50:50) of track ( ν μ CC) and cascade ( ν e CC and ν μ neutral-current) events with a total of 6 million events. The muon classifier is trained on approximately 4 million events with a combination of ν e CC (20%), ν μ CC (40%), and atmospheric muons (40%).

4.2. Testing

Performance is evaluated using the nominal MC ν μ CC and ν e CC events with flux, cross-section, and oscillation weights applied. The baseline benchmark is drawn using the current likelihood-based reconstruction [11].
As shown in Figure 4, the fractional error of energy reconstructed by the CNN has a smaller bias along the true neutrino energy compared to the likelihood-based method, especially in the low energy region; both methods show comparable performances on the reconstructed θ zenith , PID, and muon classifier. For the latter two, the area under curve (AUC) is used as the performance metric, where a larger AUC means a better performance. The reason for CNN’s better energy resolution in the low-energy region is that the focus is on the near DC region and it is optimized for low-energy events, which contribute the most to the oscillation sensitivities.
Figure 5 shows the purity of the selected samples by using the interaction vertex to select events starting inside or near the DC detector. The fraction of events selected by the CNN-reconstructed vertex that truly start in the DC region is similar to that of the likelihood-based method. For events that are far from the DC detector, the CNN has worse performance than the likelihood-based method, which is largely due to the information loss in the inputs to the two methods. Farther from the near-DC region, the likelihood-based method makes use of all photon information from the entire detector, meaning that it has more accurate estimates of the vertex position. However, because we have sufficiently large statistics in our data, the loss in uncontained events is unimportant to this analysis.

4.3. Processing Speed

With comparable or slightly better performance, CNN reconstruction runs approximately 3000 times faster than the likelihood-base method (Table 1) assuming the use of 1000 CPUs, and is even faster when running on GPUs. Faster processing speeds represent a major advantage for atmospheric neutrino datasets, where full MC simulation is approximately at the order of 10 8 events.

5. ν μ Disappearance Analysis

From the CNN reconstruction, we can construct the ν μ disappearance analysis to measure θ 23 and Δ m 32 2 . The energy, cos ( θ z e n i t h ) , and PID are used for analysis binning, as shown in Figure 6, providing better sensitivity to physics parameter measurements and robustness against systematic uncertainties. As shown in Figure 7, events with PIDs in the range of [0.55, 1] are placed in the track-like bin, events with PIDs in the range of [0, 0.25] are placed in the cascade-like bin, and the middle histogram (the mixed bin) contains the remaining events.
As shown in Figure 8, the projected sensitivity contour using CNN reconstruction shows significant improvement in comparison to the DeepCore 2021 result, and is at a level similar to using likelihood-based reconstruction. The projected sensitivity contour is compatible with previous experimental results [3,4,5,12].

6. Summary

In this analysis, we achieve comparable reconstruction performance to the likelihood-based method when employing CNNs, except with processing speeds up to 3000 times faster. When using the preliminary sample selected and binned by the CNN-reconstructed variables, the sensitivity to θ 23 and Δ m 32 2 measurement is been improved relative to the DeepCore 2021 results, and is comparable to using likelihood-based reconstruction. The CNN method can be adapted and applied to the future detector (the IceCube Upgrade [13]) to further improve the measurement precision of neutrino oscillation parameters.

Funding

The authors gratefully acknowledge the support from the following agencies and institutions: USA—U.S. National Science Foundation-Office of Polar Programs, U.S. National Science Foundation-Physics Division, U.S. National Science Foundation-EPSCoR, U.S. National Science Foundation AI Institute for Artificial Intelligence and Fundamental Interactions, U.S. National Science Foundation-Office of Advanced Cyberinfrastructure, Wisconsin Alumni Research Foundation, Center for High Throughput Computing (CHTC) at the University of Wisconsin–Madison, Open Science Grid (OSG), Partnership to Advance Throughput Computing (PATh), Advanced Cyberinfrastructure Coordination Ecosystem: Services & Support (ACCESS), Frontera computing project at the Texas Advanced Computing Center, U.S. Department of Energy-National Energy Research Scientific Computing Center, Particle astrophysics research computing center at the University of Maryland, Institute for Cyber-Enabled Research at Michigan State University, Astroparticle physics computational facility at Marquette University, NVIDIA Corporation, and Google Cloud Platform; Belgium—Funds for Scientific Research (FRS-FNRS and FWO), FWO Odysseus and Big Science programmes, and Belgian Federal Science Policy Office (Belspo); Germany—Bundesministerium für Bildung und Forschung (BMBF), Deutsche Forschungsgemeinschaft (DFG), Helmholtz Alliance for Astroparticle Physics (HAP), Initiative and Networking Fund of the Helmholtz Association, Deutsches Elektronen Synchrotron (DESY), and High Performance Computing cluster of the RWTH Aachen; Sweden—Swedish Research Council, Swedish Polar Research Secretariat, Swedish National Infrastructure for Computing (SNIC), and Knut and Alice Wallenberg Foundation; European Union—EGI Advanced Computing for research; Australia—Australian Research Council; Canada—Natural Sciences and Engineering Research Council of Canada, Calcul Québec, Compute Ontario, Canada Foundation for Innovation, WestGrid, and Digital Research Alliance of Canada; Denmark—Villum Fonden, Carlsberg Foundation, and European Commission; New Zealand—Marsden Fund; Japan—Japan Society for Promotion of Science (JSPS) and Institute for Global Prominent Research (IGPR) of Chiba University; Korea—National Research Foundation of Korea (NRF); Switzerland—Swiss National Science Foundation (SNSF).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The author declares no conflict of interest.

References

  1. Aartsen, M.G.; Ackermann, M.; Adams, J.; Aguilar, J.A.; Ahlers, M.; Ahrens, M.; Altmann, D.; Anderson, T.; Arguelles, C.; Arlen, T.C.; et al. Determining neutrino oscillation parameters from atmospheric muon neutrino disappearance with three years of IceCube DeepCore data. Phys. Rev. D 2015, 91, 072004. [Google Scholar] [CrossRef]
  2. The IceCube Collaboration. Measurement of Atmospheric Neutrino Oscillations at 6–56 GeV with IceCube DeepCore. Phys. Rev. Lett. 2018, 120, 071801. [Google Scholar] [CrossRef] [PubMed]
  3. Acero, M.A.; Adamson, P.; Aliaga, L.; Anfimov, N.; Antoshkin, A.; Arrieta-Diaz, E.; Asquith, L.; Aurisano, A.; Back, A.; Backhouse, C.; et al. The NOvA Collaboration, Improved measurement of neutrino oscillation parameters by the NOvA experiment. Phys. Rev. D 2022, 106, 032004. [Google Scholar] [CrossRef]
  4. Abe, K.; Akhlaq, N.; Akutsu, R.; Ali, A.; Alt, C.; Andreopoulos, C.; Antonova, M.; Aoki, S.; Arihara, T.; Asada, Y.; et al. The T2K Collaboration, Improved constraints on neutrino mixing from the T2K experiment with 3.13 × 1021 protons on target. Phys. Rev. D 2021, 103, 112008. [Google Scholar] [CrossRef]
  5. Adamson, P.; Anghel, I.; Aurisano, A.; Barr, G.; Blake, A.; Cao, S.V.; Carroll, T.J.; Castromonte, C.M.; Chen, R.; Childress, S.; et al. The MINOS+ Collaboration, Precision constraints for three-flavor neutrino oscillations from the full MINOS+ and MINOS data set. Phys. Rev. Lett. 2020, 125, 131802. [Google Scholar] [CrossRef] [PubMed]
  6. An, F.P.; Bai, W.D.; Balantekin, A.B.; Bishai, M.; Blyth, S.; Cao, G.F.; Cao, J.; Chang, J.F.; Chang, Y.; Chen, H.S.; et al. The Daya Bay Collaboration, Precision Measurement of Reactor Antineutrino Oscillation at Kilometer-Scale Baselines by Daya Bay. Phys. Rev. Lett. 2023, 130, 161802. [Google Scholar] [CrossRef] [PubMed]
  7. Pontecorvo, B. Mesonium and anti-mesonium. Sov. Phys. JETP 1957, 6, 429. [Google Scholar]
  8. Maki, Z.; Nakagawa, M.; Sakata, S. Remarks on the unified model of elementary particles. Prog. Theor. Phys. 1962, 28, 870. [Google Scholar] [CrossRef]
  9. Aurisano, A.; Radovic, A.; Rocco, D.; Himmel, A.; Messier, M.D.; Niner, E.; Pawloski, G.; Psihas, F.; Sousa, A.; Vahle, P. A Convolutional Neural Network Neutrino Event Classifier. J. Instrum. 2016, 11, P09001. [Google Scholar] [CrossRef]
  10. Huennefeld, M. Reconstruction Techniques in IceCube using Convolutional and Generative Neural Networks. Epj Web Conf. 2019, 207, 05005. [Google Scholar] [CrossRef]
  11. Abbasi, R.; Ackermann, M.; Adams, J.; Aguilar, J.A.; Ahlers, M.; Ahrens, M.; Alameddine, J.M.; Alves, A.A., Jr.; Amin, N.M.; Andeen, K.; et al. The IceCube Collaboration, Low Energy Event Reconstruction in IceCube DeepCore. Eur. Phys. J. C 2022, 82, 807. [Google Scholar] [CrossRef]
  12. The Super-Kamiokande Collaboration. Recent results and future prospects from Super-Kamiokande. In Proceedings of the XXIX International Conference on Neutrino Physics and Astrophysics (NEUTRINO2020), Online, 22 June–2 July 2020. [Google Scholar]
  13. Aya Ishihara for the IceCube Collaboration. The IceCube Upgrade-Design and Science Goals. arXiv 2019, arXiv:1908.09441. [Google Scholar]
Figure 1. IceCube detector (left) and top view of detector strings (right) with eight DC strings (red filled).
Figure 1. IceCube detector (left) and top view of detector strings (right) with eight DC strings (red filled).
Psf 08 00062 g001
Figure 2. The ν μ disappearance probability as a function of the energy and cos ( θ zenith ) .
Figure 2. The ν μ disappearance probability as a function of the energy and cos ( θ zenith ) .
Psf 08 00062 g002
Figure 3. Training samples for energy (left) and θ zenith (right).
Figure 3. Training samples for energy (left) and θ zenith (right).
Psf 08 00062 g003
Figure 4. Performance comparison of CNN (blue) and likelihood-based (orange) on reconstruction of (from top left to bottom right) the energy, θ zenith , PID, and muon classifier, where random represents the baseline performance of particles being randomly assigned to different categories.
Figure 4. Performance comparison of CNN (blue) and likelihood-based (orange) on reconstruction of (from top left to bottom right) the energy, θ zenith , PID, and muon classifier, where random represents the baseline performance of particles being randomly assigned to different categories.
Psf 08 00062 g004
Figure 5. Selected sample purity when using CNN (left) and likelihood (right) methods reconstructed interaction vertex; the rows sum to 1, while the numbers corresponds to the fraction of unweighted events.
Figure 5. Selected sample purity when using CNN (left) and likelihood (right) methods reconstructed interaction vertex; the rows sum to 1, while the numbers corresponds to the fraction of unweighted events.
Psf 08 00062 g005
Figure 6. Preliminary sample in cascade (left), mixed (middle), and track (right) bins with flux, cross-section, and oscillation weights applied; the pink circle highlights the ν μ disappearance “valley”.
Figure 6. Preliminary sample in cascade (left), mixed (middle), and track (right) bins with flux, cross-section, and oscillation weights applied; the pink circle highlights the ν μ disappearance “valley”.
Psf 08 00062 g006
Figure 7. MC distribution of CNN-reconstructed PIDs with stacked interaction types (colors) and boundaries used for binning (dashed).
Figure 7. MC distribution of CNN-reconstructed PIDs with stacked interaction types (colors) and boundaries used for binning (dashed).
Psf 08 00062 g007
Figure 8. Preliminary sensitivity using CNN reconstruction (black) projected from DeepCore 2021 result (yellow) compared to using likelihood-based reconstruction (red).
Figure 8. Preliminary sensitivity using CNN reconstruction (black) projected from DeepCore 2021 result (yellow) compared to using likelihood-based reconstruction (red).
Psf 08 00062 g008
Table 1. Averaged processing speed of the CNN and likelihood-based methods assuming 1000 cores.
Table 1. Averaged processing speed of the CNN and likelihood-based methods assuming 1000 cores.
Time per 3k Events (s)Full Sample (1000 Cores)
CNN on GPU21 13 min
CNN on CPU45 7.5 h
Likelihood-based on CPU120,000 46 days
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Yu, S., on behalf of the IceCube Collaboration. Measurement of Atmospheric Muon Neutrino Disappearance Using CNN Reconstructions with IceCube. Phys. Sci. Forum 2023, 8, 62. https://doi.org/10.3390/psf2023008062

AMA Style

Yu S on behalf of the IceCube Collaboration. Measurement of Atmospheric Muon Neutrino Disappearance Using CNN Reconstructions with IceCube. Physical Sciences Forum. 2023; 8(1):62. https://doi.org/10.3390/psf2023008062

Chicago/Turabian Style

Yu, Shiqi on behalf of the IceCube Collaboration. 2023. "Measurement of Atmospheric Muon Neutrino Disappearance Using CNN Reconstructions with IceCube" Physical Sciences Forum 8, no. 1: 62. https://doi.org/10.3390/psf2023008062

Article Metrics

Back to TopTop