CINXE.COM

Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches

<!DOCTYPE html> <html lang="en"> <head> <meta content="text/html; charset=utf-8" http-equiv="content-type"/> <title>Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches</title> <!--Generated on Tue Sep 10 08:27:45 2024 by LaTeXML (version 0.8.8) http://dlmf.nist.gov/LaTeXML/.--> <meta content="width=device-width, initial-scale=1, shrink-to-fit=no" name="viewport"/> <link href="https://cdn.jsdelivr.net/npm/bootstrap@5.3.0/dist/css/bootstrap.min.css" rel="stylesheet" type="text/css"/> <link href="/static/browse/0.3.4/css/ar5iv.0.7.9.min.css" rel="stylesheet" type="text/css"/> <link href="/static/browse/0.3.4/css/ar5iv-fonts.0.7.9.min.css" rel="stylesheet" type="text/css"/> <link href="/static/browse/0.3.4/css/latexml_styles.css" rel="stylesheet" type="text/css"/> <script src="https://cdn.jsdelivr.net/npm/bootstrap@5.3.0/dist/js/bootstrap.bundle.min.js"></script> <script src="https://cdnjs.cloudflare.com/ajax/libs/html2canvas/1.3.3/html2canvas.min.js"></script> <script src="/static/browse/0.3.4/js/addons_new.js"></script> <script src="/static/browse/0.3.4/js/feedbackOverlay.js"></script> <base href="/html/2409.06327v1/"/></head> <body> <nav class="ltx_page_navbar"> <nav class="ltx_TOC"> <ol class="ltx_toclist"> <li class="ltx_tocentry ltx_tocentry_section"><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S1" title="In Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_title"><span class="ltx_tag ltx_tag_ref">1 </span>Introduction</span></a></li> <li class="ltx_tocentry ltx_tocentry_section"> <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S2" title="In Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_title"><span class="ltx_tag ltx_tag_ref">2 </span>Vulnerability of contemporary ASV model to simultaneous threats</span></a> <ol class="ltx_toclist ltx_toclist_section"> <li class="ltx_tocentry ltx_tocentry_subsection"><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S2.SS1" title="In 2 Vulnerability of contemporary ASV model to simultaneous threats ‣ Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_title"><span class="ltx_tag ltx_tag_ref">2.1 </span>Testing dataset</span></a></li> <li class="ltx_tocentry ltx_tocentry_subsection"><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S2.SS2" title="In 2 Vulnerability of contemporary ASV model to simultaneous threats ‣ Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_title"><span class="ltx_tag ltx_tag_ref">2.2 </span>Preliminary experimental result of contemporary ASV model</span></a></li> </ol> </li> <li class="ltx_tocentry ltx_tocentry_section"> <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S3" title="In Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_title"><span class="ltx_tag ltx_tag_ref">3 </span>Proposed integrated approach</span></a> <ol class="ltx_toclist ltx_toclist_section"> <li class="ltx_tocentry ltx_tocentry_subsection"><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S3.SS1" title="In 3 Proposed integrated approach ‣ Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_title"><span class="ltx_tag ltx_tag_ref">3.1 </span>Overview</span></a></li> <li class="ltx_tocentry ltx_tocentry_subsection"> <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S3.SS2" title="In 3 Proposed integrated approach ‣ Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_title"><span class="ltx_tag ltx_tag_ref">3.2 </span>Model architecture and objectives</span></a> <ol class="ltx_toclist ltx_toclist_subsection"> <li class="ltx_tocentry ltx_tocentry_subsubsection"><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S3.SS2.SSS1" title="In 3.2 Model architecture and objectives ‣ 3 Proposed integrated approach ‣ Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_title"><span class="ltx_tag ltx_tag_ref">3.2.1 </span>Asymmetric dual-path feature extractor</span></a></li> <li class="ltx_tocentry ltx_tocentry_subsubsection"><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S3.SS2.SSS2" title="In 3.2 Model architecture and objectives ‣ 3 Proposed integrated approach ‣ Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_title"><span class="ltx_tag ltx_tag_ref">3.2.2 </span>Speaker classifier and spoof classifier</span></a></li> <li class="ltx_tocentry ltx_tocentry_subsubsection"><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S3.SS2.SSS3" title="In 3.2 Model architecture and objectives ‣ 3 Proposed integrated approach ‣ Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_title"><span class="ltx_tag ltx_tag_ref">3.2.3 </span>SASV binary classifier</span></a></li> </ol> </li> <li class="ltx_tocentry ltx_tocentry_subsection"> <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S3.SS3" title="In 3 Proposed integrated approach ‣ Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_title"><span class="ltx_tag ltx_tag_ref">3.3 </span>Learning paradigm</span></a> <ol class="ltx_toclist ltx_toclist_subsection"> <li class="ltx_tocentry ltx_tocentry_subsubsection"><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S3.SS3.SSS1" title="In 3.3 Learning paradigm ‣ 3 Proposed integrated approach ‣ Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_title"><span class="ltx_tag ltx_tag_ref">3.3.1 </span>Outer loop</span></a></li> <li class="ltx_tocentry ltx_tocentry_subsubsection"><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S3.SS3.SSS2" title="In 3.3 Learning paradigm ‣ 3 Proposed integrated approach ‣ Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_title"><span class="ltx_tag ltx_tag_ref">3.3.2 </span>Inner loop</span></a></li> </ol> </li> </ol> </li> <li class="ltx_tocentry ltx_tocentry_section"> <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S4" title="In Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_title"><span class="ltx_tag ltx_tag_ref">4 </span>Experimental results</span></a> <ol class="ltx_toclist ltx_toclist_section"> <li class="ltx_tocentry ltx_tocentry_subsection"><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S4.SS1" title="In 4 Experimental results ‣ Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_title"><span class="ltx_tag ltx_tag_ref">4.1 </span>Dataset</span></a></li> <li class="ltx_tocentry ltx_tocentry_subsection"><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S4.SS2" title="In 4 Experimental results ‣ Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_title"><span class="ltx_tag ltx_tag_ref">4.2 </span>Experimental settings</span></a></li> <li class="ltx_tocentry ltx_tocentry_subsection"> <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S4.SS3" title="In 4 Experimental results ‣ Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_title"><span class="ltx_tag ltx_tag_ref">4.3 </span>Results and analysis</span></a> <ol class="ltx_toclist ltx_toclist_subsection"> <li class="ltx_tocentry ltx_tocentry_subsubsection"><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S4.SS3.SSS1" title="In 4.3 Results and analysis ‣ 4 Experimental results ‣ Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_title"><span class="ltx_tag ltx_tag_ref">4.3.1 </span>Result for channel mismatch scenario</span></a></li> <li class="ltx_tocentry ltx_tocentry_subsubsection"><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S4.SS3.SSS2" title="In 4.3 Results and analysis ‣ 4 Experimental results ‣ Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_title"><span class="ltx_tag ltx_tag_ref">4.3.2 </span>Result for spoofing attack scenario</span></a></li> <li class="ltx_tocentry ltx_tocentry_subsubsection"><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S4.SS3.SSS3" title="In 4.3 Results and analysis ‣ 4 Experimental results ‣ Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_title"><span class="ltx_tag ltx_tag_ref">4.3.3 </span>Result for domain mismatch scenario</span></a></li> </ol> </li> </ol> </li> <li class="ltx_tocentry ltx_tocentry_section"><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S5" title="In Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_title"><span class="ltx_tag ltx_tag_ref">5 </span>Conclusion</span></a></li> <li class="ltx_tocentry ltx_tocentry_section"><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S6" title="In Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_title"><span class="ltx_tag ltx_tag_ref">6 </span>ACKNOWLEDGMENTS</span></a></li> </ol></nav> </nav> <div class="ltx_page_main"> <div class="ltx_page_content"> <article class="ltx_document ltx_authors_1line"> <h1 class="ltx_title ltx_title_document">Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches</h1> <div class="ltx_abstract"> <h6 class="ltx_title ltx_title_abstract">Abstract</h6> <p class="ltx_p" id="id1.id1"><span class="ltx_text" id="id1.id1.1" style="font-size:90%;">In real-world applications, it is challenging to build a speaker verification system that is simultaneously robust against common threats, including spoofing attacks, channel mismatch, and domain mismatch. Traditional automatic speaker verification (ASV) systems often tackle these issues separately, leading to suboptimal performance when faced with simultaneous challenges. In this paper, we propose an integrated framework that incorporates pair-wise learning and spoofing attack simulation into the meta-learning paradigm to enhance robustness against these multifaceted threats. This novel approach employs an asymmetric dual-path model and a multi-task learning strategy to handle ASV, anti-spoofing, and spoofing-aware ASV tasks concurrently. A new testing dataset, CNComplex, is introduced to evaluate system performance under these combined threats. Experimental results demonstrate that our integrated model significantly improves performance over traditional ASV systems across various scenarios, showcasing its potential for real-world deployment. Additionally, the proposed framework’s ability to generalize across different conditions highlights its robustness and reliability, making it a promising solution for practical ASV applications.</span></p> </div> <div class="ltx_para" id="p1"> <p class="ltx_p" id="p1.1"><span class="ltx_text ltx_font_bold ltx_font_italic" id="p1.1.1" style="font-size:90%;">Index Terms<span class="ltx_text ltx_font_upright" id="p1.1.1.1">— <span class="ltx_text ltx_font_medium" id="p1.1.1.1.1"> Speaker verification, robustness, multi-task learning, meta-learning</span></span></span></p> </div> <section class="ltx_section" id="S1"> <h2 class="ltx_title ltx_title_section" style="font-size:90%;"> <span class="ltx_tag ltx_tag_section">1 </span>Introduction</h2> <div class="ltx_para" id="S1.p1"> <p class="ltx_p" id="S1.p1.1"><span class="ltx_text" id="S1.p1.1.1" style="font-size:90%;">Automatic speaker verification (ASV) aims to accurately identify the speaker’s identity by analyzing voice characteristics within speech signals, providing robust support for personalized services. Mainstream ASV solutions have evolved from traditional statistics-based methods such as GMM-UBM </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S1.p1.1.2.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib1" title="">1</a><span class="ltx_text" id="S1.p1.1.3.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S1.p1.1.4" style="font-size:90%;"> and I-Vector </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S1.p1.1.5.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib2" title="">2</a><span class="ltx_text" id="S1.p1.1.6.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S1.p1.1.7" style="font-size:90%;"> to deep neural network-based approaches </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S1.p1.1.8.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib3" title="">3</a>, <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib4" title="">4</a><span class="ltx_text" id="S1.p1.1.9.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S1.p1.1.10" style="font-size:90%;">. For example, time-delay neural network (TDNN) </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S1.p1.1.11.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib5" title="">5</a>, <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib6" title="">6</a>, <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib7" title="">7</a><span class="ltx_text" id="S1.p1.1.12.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S1.p1.1.13" style="font-size:90%;"> and ResNet </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S1.p1.1.14.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib8" title="">8</a>, <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib9" title="">9</a><span class="ltx_text" id="S1.p1.1.15.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S1.p1.1.16" style="font-size:90%;"> have dominated the realm of extracting speaker embeddings since they can efficiently leverage large amounts of speech data to capture the properties of speakers accurately. Additionally, the back-end is another important component of an ASV system. Neural network-based methods </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S1.p1.1.17.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib10" title="">10</a>, <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib11" title="">11</a>, <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib12" title="">12</a><span class="ltx_text" id="S1.p1.1.18.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S1.p1.1.19" style="font-size:90%;"> have gradually replaced the widely used PLDA model in recent years.</span></p> </div> <div class="ltx_para" id="S1.p2"> <p class="ltx_p" id="S1.p2.1"><span class="ltx_text" id="S1.p2.1.1" style="font-size:90%;">However, as speaker recognition has rapidly evolved, its focus has shifted from improving performance on simple datasets such as TIMIT and LibriSpeech </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S1.p2.1.2.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib13" title="">13</a>, <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib14" title="">14</a><span class="ltx_text" id="S1.p2.1.3.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S1.p2.1.4" style="font-size:90%;"> to addressing challenges in what is often referred to as “recognition in the wild” </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S1.p2.1.5.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib15" title="">15</a>, <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib16" title="">16</a>, <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib17" title="">17</a><span class="ltx_text" id="S1.p2.1.6.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S1.p2.1.7" style="font-size:90%;">. In this context, numerous issues have surfaced, including channel mismatch </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S1.p2.1.8.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib18" title="">18</a>, <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib19" title="">19</a>, <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib20" title="">20</a>, <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib21" title="">21</a><span class="ltx_text" id="S1.p2.1.9.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S1.p2.1.10" style="font-size:90%;">, vulnerability to spoofing attacks </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S1.p2.1.11.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib22" title="">22</a>, <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib23" title="">23</a>, <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib24" title="">24</a><span class="ltx_text" id="S1.p2.1.12.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S1.p2.1.13" style="font-size:90%;">, and domain mismatches </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S1.p2.1.14.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib25" title="">25</a><span class="ltx_text" id="S1.p2.1.15.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S1.p2.1.16" style="font-size:90%;">, as shown in Figure </span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S1.F1" style="font-size:90%;" title="Figure 1 ‣ 1 Introduction ‣ Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_tag">1</span></a><span class="ltx_text" id="S1.p2.1.17" style="font-size:90%;">. While substantial progress has been made in addressing each of these challenges individually </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S1.p2.1.18.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib26" title="">26</a>, <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib27" title="">27</a>, <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib28" title="">28</a><span class="ltx_text" id="S1.p2.1.19.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S1.p2.1.20" style="font-size:90%;">, the practical deployment of ASV systems demands solutions that can effectively handle these multifaceted threats. Given the potential for the ASV system to encounter simultaneous attacks, there is a critical need to amalgamate all these functionalities within the ASV system itself.</span></p> </div> <figure class="ltx_figure" id="S1.F1"><img alt="Refer to caption" class="ltx_graphics ltx_centering ltx_img_landscape" height="116" id="S1.F1.g1" src="x1.png" width="436"/> <figcaption class="ltx_caption ltx_centering" style="font-size:90%;"><span class="ltx_tag ltx_tag_figure"><span class="ltx_text ltx_font_bold" id="S1.F1.4.1.1">Fig. 1</span>: </span>The primary threats to ASV systems. These threats can be categorized into channel mismatch, spoofing attacks, and domain mismatch.</figcaption> </figure> <div class="ltx_para" id="S1.p3"> <p class="ltx_p" id="S1.p3.1"><span class="ltx_text" id="S1.p3.1.1" style="font-size:90%;">However, the integration is hindered by two factors.</span></p> <ol class="ltx_enumerate" id="S1.I1"> <li class="ltx_item" id="S1.I1.i1" style="list-style-type:none;"> <span class="ltx_tag ltx_tag_item">1.</span> <div class="ltx_para" id="S1.I1.i1.p1"> <p class="ltx_p" id="S1.I1.i1.p1.1"><span class="ltx_text" id="S1.I1.i1.p1.1.1" style="font-size:90%;">Presently, no dataset encompasses the presence of three concurrent threats. For instance, widely used datasets for speaker verification such as VoxCeleb 1&amp;2 </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S1.I1.i1.p1.1.2.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib15" title="">15</a>, <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib29" title="">29</a><span class="ltx_text" id="S1.I1.i1.p1.1.3.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S1.I1.i1.p1.1.4" style="font-size:90%;"> and CNCeleb 1&amp;2 </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S1.I1.i1.p1.1.5.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib17" title="">17</a>, <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib30" title="">30</a><span class="ltx_text" id="S1.I1.i1.p1.1.6.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S1.I1.i1.p1.1.7" style="font-size:90%;"> exclusively feature scenarios involving channel mismatch. Meanwhile, datasets designed for the ASVspoof challenge series </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S1.I1.i1.p1.1.8.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib22" title="">22</a>, <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib23" title="">23</a><span class="ltx_text" id="S1.I1.i1.p1.1.9.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S1.I1.i1.p1.1.10" style="font-size:90%;"> solely tackle spoofing attacks. Hence, it is imperative to develop a new dataset and evaluation protocol encompassing all three considerations.</span></p> </div> </li> <li class="ltx_item" id="S1.I1.i2" style="list-style-type:none;"> <span class="ltx_tag ltx_tag_item">2.</span> <div class="ltx_para" id="S1.I1.i2.p1"> <p class="ltx_p" id="S1.I1.i2.p1.1"><span class="ltx_text" id="S1.I1.i2.p1.1.1" style="font-size:90%;">One apparent method to incorporate all functionalities involves cascading individual components, each dedicated to addressing a specific challenge. For instance, positioning the countermeasures module ahead of the ASV module serves to detect synthesized input utterances. Additionally, regarding the ASV module as comprising a neural speaker encoder and a back-end, assigning domain mismatch handling to the former and channel mismatch to the latter seems intuitive. However, as discussed in </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S1.I1.i2.p1.1.2.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib27" title="">27</a><span class="ltx_text" id="S1.I1.i2.p1.1.3.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S1.I1.i2.p1.1.4" style="font-size:90%;">, this cascading approach overlooks the interactions among these modules, leading to suboptimal system performance. Consequently, a novel framework must be devised to facilitate and leverage these interactions effectively.</span></p> </div> </li> </ol> </div> <div class="ltx_para" id="S1.p4"> <p class="ltx_p" id="S1.p4.1"><span class="ltx_text" id="S1.p4.1.1" style="font-size:90%;">In response to this practical need, in this paper we introduce an integrated framework that harmonizes the techniques elucidated in previous studies </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S1.p4.1.2.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib11" title="">11</a>, <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib27" title="">27</a>, <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib28" title="">28</a><span class="ltx_text" id="S1.p4.1.3.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S1.p4.1.4" style="font-size:90%;">. This includes the pair-wise learning paradigm, which simulates channel mismatch, the fusion module, which replicates spoofing attacks, and the meta-learning paradigm, which mimics domain mismatch. The aim is to establish a robust defense against channel variability, spoofing attacks, and domain mismatch, all in one comprehensive approach. This represents an end-to-end methodology, optimizing resilience across these challenges by utilizing a shared model. To the best of our knowledge, this is the first attempt to integrate robustness into a single model by explicitly exposing the model to all threats. The primary contribution lies in amalgamating the pair-wise learning, spoofing simulation, and meta-learning paradigms into a unified optimization strategy.</span></p> </div> <div class="ltx_para" id="S1.p5"> <p class="ltx_p" id="S1.p5.1"><span class="ltx_text" id="S1.p5.1.1" style="font-size:90%;">The subsequent sections of this paper are structured as follows. We commence with a preliminary experiment with the ECAPA-TDNN model on a new spoofing-aware ASV evaluation dataset, which contains testing trials for both channel and domain mismatches, offering evidence of the vulnerability of contemporary SOTA ASV models when facing three threats simultaneously. Next, an overview of the proposed integrated approach is given in Section </span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S3" style="font-size:90%;" title="3 Proposed integrated approach ‣ Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_tag">3</span></a><span class="ltx_text" id="S1.p5.1.2" style="font-size:90%;">, including the overview of it to offer insights into the integration methodology in Section </span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S3.SS1" style="font-size:90%;" title="3.1 Overview ‣ 3 Proposed integrated approach ‣ Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_tag">3.1</span></a><span class="ltx_text" id="S1.p5.1.3" style="font-size:90%;">, the specifics of the model architecture and its objectives in Section </span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S3.SS2" style="font-size:90%;" title="3.2 Model architecture and objectives ‣ 3 Proposed integrated approach ‣ Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_tag">3.2</span></a><span class="ltx_text" id="S1.p5.1.4" style="font-size:90%;">, and the learning paradigm for E2E optimization in Section </span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S3.SS3" style="font-size:90%;" title="3.3 Learning paradigm ‣ 3 Proposed integrated approach ‣ Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_tag">3.3</span></a><span class="ltx_text" id="S1.p5.1.5" style="font-size:90%;">. The experimental setup, along with the results and their analysis, is detailed in Section </span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S4" style="font-size:90%;" title="4 Experimental results ‣ Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_tag">4</span></a><span class="ltx_text" id="S1.p5.1.6" style="font-size:90%;">. Section </span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S5" style="font-size:90%;" title="5 Conclusion ‣ Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_tag">5</span></a><span class="ltx_text" id="S1.p5.1.7" style="font-size:90%;"> concludes the paper with a comprehensive summary.</span></p> </div> </section> <section class="ltx_section" id="S2"> <h2 class="ltx_title ltx_title_section" style="font-size:90%;"> <span class="ltx_tag ltx_tag_section">2 </span>Vulnerability of contemporary ASV model to simultaneous threats</h2> <figure class="ltx_table" id="S2.T1"> <figcaption class="ltx_caption" style="font-size:90%;"><span class="ltx_tag ltx_tag_table"><span class="ltx_text ltx_font_bold" id="S2.T1.4.1.1">Table 1</span>: </span>Genre group division. Ten genres are randomly divided into four groups.</figcaption> <table class="ltx_tabular ltx_centering ltx_guessed_headers ltx_align_middle" id="S2.T1.5"> <thead class="ltx_thead"> <tr class="ltx_tr" id="S2.T1.5.1.1"> <th class="ltx_td ltx_align_left ltx_th ltx_th_column ltx_th_row ltx_border_tt" id="S2.T1.5.1.1.1"><span class="ltx_text ltx_font_bold" id="S2.T1.5.1.1.1.1" style="font-size:90%;">Group</span></th> <th class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_tt" id="S2.T1.5.1.1.2"><span class="ltx_text ltx_font_bold" id="S2.T1.5.1.1.2.1" style="font-size:90%;">Genre Types</span></th> </tr> </thead> <tbody class="ltx_tbody"> <tr class="ltx_tr" id="S2.T1.5.2.1"> <th class="ltx_td ltx_align_left ltx_th ltx_th_row ltx_border_t" id="S2.T1.5.2.1.1"><span class="ltx_text" id="S2.T1.5.2.1.1.1" style="font-size:90%;">Group I</span></th> <td class="ltx_td ltx_align_center ltx_border_t" id="S2.T1.5.2.1.2"><span class="ltx_text" id="S2.T1.5.2.1.2.1" style="font-size:90%;">drama (dr), vlog (vl), speech (sp)</span></td> </tr> <tr class="ltx_tr" id="S2.T1.5.3.2"> <th class="ltx_td ltx_align_left ltx_th ltx_th_row ltx_border_t" id="S2.T1.5.3.2.1"><span class="ltx_text" id="S2.T1.5.3.2.1.1" style="font-size:90%;">Group II</span></th> <td class="ltx_td ltx_align_center ltx_border_t" id="S2.T1.5.3.2.2"><span class="ltx_text" id="S2.T1.5.3.2.2.1" style="font-size:90%;">entertainment (en), interview (in), play (pl)</span></td> </tr> <tr class="ltx_tr" id="S2.T1.5.4.3"> <th class="ltx_td ltx_align_left ltx_th ltx_th_row ltx_border_t" id="S2.T1.5.4.3.1"><span class="ltx_text" id="S2.T1.5.4.3.1.1" style="font-size:90%;">Group III</span></th> <td class="ltx_td ltx_align_center ltx_border_t" id="S2.T1.5.4.3.2"><span class="ltx_text" id="S2.T1.5.4.3.2.1" style="font-size:90%;">live broadcast (lb), movie (mo)</span></td> </tr> <tr class="ltx_tr" id="S2.T1.5.5.4"> <th class="ltx_td ltx_align_left ltx_th ltx_th_row ltx_border_bb ltx_border_t" id="S2.T1.5.5.4.1"><span class="ltx_text" id="S2.T1.5.5.4.1.1" style="font-size:90%;">Group IV</span></th> <td class="ltx_td ltx_align_center ltx_border_bb ltx_border_t" id="S2.T1.5.5.4.2"><span class="ltx_text" id="S2.T1.5.5.4.2.1" style="font-size:90%;">singing (si), recitation (re)</span></td> </tr> </tbody> </table> </figure> <figure class="ltx_table" id="S2.T2"> <figcaption class="ltx_caption" style="font-size:90%;"><span class="ltx_tag ltx_tag_table"><span class="ltx_text ltx_font_bold" id="S2.T2.4.1.1">Table 2</span>: </span>Cross-genre protocols (CGPs) established via K-fold validation. For each protocol, the training dataset only contains seen genre groups, while the testing dataset contains both seen and unseen genre groups. The unseen genre in the testing dataset results in a cross-genre scenario. Note that speakers in the training and testing datasets overlap.</figcaption> <table class="ltx_tabular ltx_centering ltx_guessed_headers ltx_align_middle" id="S2.T2.5"> <thead class="ltx_thead"> <tr class="ltx_tr" id="S2.T2.5.1.1"> <th class="ltx_td ltx_align_left ltx_th ltx_th_column ltx_th_row ltx_border_tt" id="S2.T2.5.1.1.1" style="padding-left:4.0pt;padding-right:4.0pt;"><span class="ltx_text ltx_font_bold" id="S2.T2.5.1.1.1.1" style="font-size:90%;">CGP</span></th> <th class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_tt" id="S2.T2.5.1.1.2" style="padding-left:4.0pt;padding-right:4.0pt;"><span class="ltx_text ltx_font_bold" id="S2.T2.5.1.1.2.1" style="font-size:90%;">Seen Genres</span></th> <th class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_tt" id="S2.T2.5.1.1.3" style="padding-left:4.0pt;padding-right:4.0pt;"><span class="ltx_text ltx_font_bold" id="S2.T2.5.1.1.3.1" style="font-size:90%;">Unseen Genres</span></th> </tr> </thead> <tbody class="ltx_tbody"> <tr class="ltx_tr" id="S2.T2.5.2.1"> <th class="ltx_td ltx_align_left ltx_th ltx_th_row ltx_border_t" id="S2.T2.5.2.1.1" style="padding-left:4.0pt;padding-right:4.0pt;"><span class="ltx_text" id="S2.T2.5.2.1.1.1" style="font-size:90%;">CGP I</span></th> <td class="ltx_td ltx_align_center ltx_border_t" id="S2.T2.5.2.1.2" style="padding-left:4.0pt;padding-right:4.0pt;"><span class="ltx_text" id="S2.T2.5.2.1.2.1" style="font-size:90%;">Group I, Group II, Group III</span></td> <td class="ltx_td ltx_align_center ltx_border_t" id="S2.T2.5.2.1.3" style="padding-left:4.0pt;padding-right:4.0pt;"><span class="ltx_text" id="S2.T2.5.2.1.3.1" style="font-size:90%;">Group IV</span></td> </tr> <tr class="ltx_tr" id="S2.T2.5.3.2"> <th class="ltx_td ltx_align_left ltx_th ltx_th_row ltx_border_t" id="S2.T2.5.3.2.1" style="padding-left:4.0pt;padding-right:4.0pt;"><span class="ltx_text" id="S2.T2.5.3.2.1.1" style="font-size:90%;">CGP II</span></th> <td class="ltx_td ltx_align_center ltx_border_t" id="S2.T2.5.3.2.2" style="padding-left:4.0pt;padding-right:4.0pt;"><span class="ltx_text" id="S2.T2.5.3.2.2.1" style="font-size:90%;">Group I, Group II, Group IV</span></td> <td class="ltx_td ltx_align_center ltx_border_t" id="S2.T2.5.3.2.3" style="padding-left:4.0pt;padding-right:4.0pt;"><span class="ltx_text" id="S2.T2.5.3.2.3.1" style="font-size:90%;">Group III</span></td> </tr> <tr class="ltx_tr" id="S2.T2.5.4.3"> <th class="ltx_td ltx_align_left ltx_th ltx_th_row ltx_border_t" id="S2.T2.5.4.3.1" style="padding-left:4.0pt;padding-right:4.0pt;"><span class="ltx_text" id="S2.T2.5.4.3.1.1" style="font-size:90%;">CGP III</span></th> <td class="ltx_td ltx_align_center ltx_border_t" id="S2.T2.5.4.3.2" style="padding-left:4.0pt;padding-right:4.0pt;"><span class="ltx_text" id="S2.T2.5.4.3.2.1" style="font-size:90%;">Group I, Group III, Group IV</span></td> <td class="ltx_td ltx_align_center ltx_border_t" id="S2.T2.5.4.3.3" style="padding-left:4.0pt;padding-right:4.0pt;"><span class="ltx_text" id="S2.T2.5.4.3.3.1" style="font-size:90%;">Group II</span></td> </tr> <tr class="ltx_tr" id="S2.T2.5.5.4"> <th class="ltx_td ltx_align_left ltx_th ltx_th_row ltx_border_bb ltx_border_t" id="S2.T2.5.5.4.1" style="padding-left:4.0pt;padding-right:4.0pt;"><span class="ltx_text" id="S2.T2.5.5.4.1.1" style="font-size:90%;">CGP IV</span></th> <td class="ltx_td ltx_align_center ltx_border_bb ltx_border_t" id="S2.T2.5.5.4.2" style="padding-left:4.0pt;padding-right:4.0pt;"><span class="ltx_text" id="S2.T2.5.5.4.2.1" style="font-size:90%;">Group II, Group III, Group IV</span></td> <td class="ltx_td ltx_align_center ltx_border_bb ltx_border_t" id="S2.T2.5.5.4.3" style="padding-left:4.0pt;padding-right:4.0pt;"><span class="ltx_text" id="S2.T2.5.5.4.3.1" style="font-size:90%;">Group I</span></td> </tr> </tbody> </table> </figure> <section class="ltx_subsection" id="S2.SS1"> <h3 class="ltx_title ltx_title_subsection" style="font-size:90%;"> <span class="ltx_tag ltx_tag_subsection">2.1 </span>Testing dataset</h3> <div class="ltx_para" id="S2.SS1.p1"> <p class="ltx_p" id="S2.SS1.p1.1"><span class="ltx_text" id="S2.SS1.p1.1.1" style="font-size:90%;">In light of the absence of an existing dataset that aligns with requirements for evaluating system performance in the challenging scenario involving all three threats simultaneously, it is imperative to create a new testing dataset and devise a corresponding evaluation protocol tailored to this scenario. This newly developed test dataset was constructed based on the original CNCeleb test dataset, as CNCeleb has multiple genres, making it well-suited as a basis for building complex test sets. Within this dataset, the enrollment utterances remain unchanged. However, the testing utterances are subject to random substitution with re-vocoded data sourced from the CNSpoof dataset </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S2.SS1.p1.1.2.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib31" title="">31</a><span class="ltx_text" id="S2.SS1.p1.1.3.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S2.SS1.p1.1.4" style="font-size:90%;">. The new testing dataset is referred to as CNComplex.</span></p> </div> <div class="ltx_para" id="S2.SS1.p2"> <p class="ltx_p" id="S2.SS1.p2.1"><span class="ltx_text" id="S2.SS1.p2.1.1" style="font-size:90%;">In terms of establishing the ground truth, we adhere to the definition outlined in SASVC 2022 </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S2.SS1.p2.1.2.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib32" title="">32</a><span class="ltx_text" id="S2.SS1.p2.1.3.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S2.SS1.p2.1.4" style="font-size:90%;">. Specifically, if the testing utterance is genuine and its corresponding speaker label matches the enrolled identity, it is labeled as true; otherwise, it is marked as false. Consequently, the new testing dataset now encompasses both channel mismatch and spoofing attacks.</span></p> </div> </section> <section class="ltx_subsection" id="S2.SS2"> <h3 class="ltx_title ltx_title_subsection" style="font-size:90%;"> <span class="ltx_tag ltx_tag_subsection">2.2 </span>Preliminary experimental result of contemporary ASV model</h3> <div class="ltx_para" id="S2.SS2.p1"> <p class="ltx_p" id="S2.SS2.p1.1"><span class="ltx_text" id="S2.SS2.p1.1.1" style="font-size:90%;">To demonstrate the vulnerability of the contemporary ASV model when faced with the CNComplex testing dataset, an ECAPA-TDNN speaker encoder was trained, incorporating the AM-Softmax loss function </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S2.SS2.p1.1.2.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib33" title="">33</a>, <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib34" title="">34</a><span class="ltx_text" id="S2.SS2.p1.1.3.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S2.SS2.p1.1.4" style="font-size:90%;">, on two separate datasets: CNCeleb 1&amp;2 and the combination of CNCeleb 1&amp;2 and CNSpoof datasets according to the cross-genre protocols (CGPs) described in </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S2.SS2.p1.1.5.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib31" title="">31</a><span class="ltx_text" id="S2.SS2.p1.1.6.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S2.SS2.p1.1.7" style="font-size:90%;">. Each protocol excludes data with certain genres from the training dataset. In detail, the ten genres present in the training data were categorized into four groups, as outlined in Table </span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S2.T1" style="font-size:90%;" title="Table 1 ‣ 2 Vulnerability of contemporary ASV model to simultaneous threats ‣ Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_tag">1</span></a><span class="ltx_text" id="S2.SS2.p1.1.8" style="font-size:90%;">. It is worth noting that the ”advertisement” genre was excluded from the dataset due to its limited amount of data. For each CGP, </span><math alttext="660,000" class="ltx_Math" display="inline" id="S2.SS2.p1.1.m1.2"><semantics id="S2.SS2.p1.1.m1.2a"><mrow id="S2.SS2.p1.1.m1.2.3.2" xref="S2.SS2.p1.1.m1.2.3.1.cmml"><mn id="S2.SS2.p1.1.m1.1.1" mathsize="90%" xref="S2.SS2.p1.1.m1.1.1.cmml">660</mn><mo id="S2.SS2.p1.1.m1.2.3.2.1" mathsize="90%" xref="S2.SS2.p1.1.m1.2.3.1.cmml">,</mo><mn id="S2.SS2.p1.1.m1.2.2" mathsize="90%" xref="S2.SS2.p1.1.m1.2.2.cmml">000</mn></mrow><annotation-xml encoding="MathML-Content" id="S2.SS2.p1.1.m1.2b"><list id="S2.SS2.p1.1.m1.2.3.1.cmml" xref="S2.SS2.p1.1.m1.2.3.2"><cn id="S2.SS2.p1.1.m1.1.1.cmml" type="integer" xref="S2.SS2.p1.1.m1.1.1">660</cn><cn id="S2.SS2.p1.1.m1.2.2.cmml" type="integer" xref="S2.SS2.p1.1.m1.2.2">000</cn></list></annotation-xml><annotation encoding="application/x-tex" id="S2.SS2.p1.1.m1.2c">660,000</annotation><annotation encoding="application/x-llamapun" id="S2.SS2.p1.1.m1.2d">660 , 000</annotation></semantics></math><span class="ltx_text" id="S2.SS2.p1.1.9" style="font-size:90%;"> utterances were randomly sampled to compile a training dataset, which encompassed three of the genre groups. Importantly, in this way, each training dataset has an unseen genre group in the testing dataset. This strategy is detailed in Table </span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S2.T2" style="font-size:90%;" title="Table 2 ‣ 2 Vulnerability of contemporary ASV model to simultaneous threats ‣ Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_tag">2</span></a><span class="ltx_text" id="S2.SS2.p1.1.10" style="font-size:90%;">.</span></p> </div> <div class="ltx_para" id="S2.SS2.p2"> <p class="ltx_p" id="S2.SS2.p2.1"><span class="ltx_text" id="S2.SS2.p2.1.1" style="font-size:90%;">The results of this experiment evaluated by cosine similarity are presented in Table </span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S2.T3" style="font-size:90%;" title="Table 3 ‣ 2.2 Preliminary experimental result of contemporary ASV model ‣ 2 Vulnerability of contemporary ASV model to simultaneous threats ‣ Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_tag">3</span></a><span class="ltx_text" id="S2.SS2.p2.1.2" style="font-size:90%;">. We subjected these well-trained models to evaluation on both the original CNCeleb testing dataset (CNCeleb.Eval) and the newly introduced CNComplex dataset.</span></p> </div> <figure class="ltx_table" id="S2.T3"> <figcaption class="ltx_caption" style="font-size:80%;"><span class="ltx_tag ltx_tag_table"><span class="ltx_text ltx_font_bold" id="S2.T3.7.1.1">Table 3</span>: </span>Performance of the ECAPA-TDNN model trained on two distinct sets of data: CNCeleb 1&amp;2 and the combination of (CNCeleb 1&amp;2, CNSpoof) according to CGP. Subsequently, evaluations on both the original CNCeleb testing dataset and the newly developed testing dataset were conducted. The evaluation metrics employed for this assessment are the SV-EER and SASV-EER defined in <cite class="ltx_cite ltx_citemacro_cite">[<a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib27" title="">27</a>]</cite>.</figcaption> <table class="ltx_tabular ltx_centering ltx_guessed_headers ltx_align_middle" id="S2.T3.8"> <thead class="ltx_thead"> <tr class="ltx_tr" id="S2.T3.8.1.1"> <th class="ltx_td ltx_align_left ltx_th ltx_th_column ltx_th_row ltx_border_tt" id="S2.T3.8.1.1.1" rowspan="2" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text ltx_font_bold" id="S2.T3.8.1.1.1.1" style="font-size:80%;">Protocol</span></th> <th class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_tt" id="S2.T3.8.1.1.2" rowspan="2" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text ltx_font_bold" id="S2.T3.8.1.1.2.1" style="font-size:80%;">Training dataset</span></th> <th class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_tt" colspan="2" id="S2.T3.8.1.1.3" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text ltx_font_bold" id="S2.T3.8.1.1.3.1" style="font-size:80%;">CNCeleb.Eval</span></th> <th class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_tt" colspan="2" id="S2.T3.8.1.1.4" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text ltx_font_bold" id="S2.T3.8.1.1.4.1" style="font-size:80%;">CNComplex</span></th> </tr> <tr class="ltx_tr" id="S2.T3.8.2.2"> <th class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_t" id="S2.T3.8.2.2.1" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.2.2.1.1" style="font-size:80%;">SV-EER</span></th> <th class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_t" id="S2.T3.8.2.2.2" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.2.2.2.1" style="font-size:80%;">SASV-EER</span></th> <th class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_t" id="S2.T3.8.2.2.3" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.2.2.3.1" style="font-size:80%;">SV-EER</span></th> <th class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_t" id="S2.T3.8.2.2.4" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.2.2.4.1" style="font-size:80%;">SASV-EER</span></th> </tr> </thead> <tbody class="ltx_tbody"> <tr class="ltx_tr" id="S2.T3.8.3.1"> <th class="ltx_td ltx_align_left ltx_th ltx_th_row ltx_border_t" id="S2.T3.8.3.1.1" rowspan="2" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text ltx_font_bold" id="S2.T3.8.3.1.1.1" style="font-size:80%;">CGP I</span></th> <td class="ltx_td ltx_align_center ltx_border_t" id="S2.T3.8.3.1.2" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.3.1.2.1" style="font-size:80%;">CNCeleb 1&amp;2</span></td> <td class="ltx_td ltx_align_center ltx_border_t" id="S2.T3.8.3.1.3" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.3.1.3.1" style="font-size:80%;">9.38</span></td> <td class="ltx_td ltx_align_center ltx_border_t" id="S2.T3.8.3.1.4" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.3.1.4.1" style="font-size:80%;">-</span></td> <td class="ltx_td ltx_align_center ltx_border_t" id="S2.T3.8.3.1.5" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.3.1.5.1" style="font-size:80%;">10.05</span></td> <td class="ltx_td ltx_align_center ltx_border_t" id="S2.T3.8.3.1.6" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.3.1.6.1" style="font-size:80%;">37.64</span></td> </tr> <tr class="ltx_tr" id="S2.T3.8.4.2"> <td class="ltx_td ltx_align_center" id="S2.T3.8.4.2.1" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.4.2.1.1" style="font-size:80%;">Combination</span></td> <td class="ltx_td ltx_align_center" id="S2.T3.8.4.2.2" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.4.2.2.1" style="font-size:80%;">9.02</span></td> <td class="ltx_td ltx_align_center" id="S2.T3.8.4.2.3" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.4.2.3.1" style="font-size:80%;">-</span></td> <td class="ltx_td ltx_align_center" id="S2.T3.8.4.2.4" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.4.2.4.1" style="font-size:80%;">9.56</span></td> <td class="ltx_td ltx_align_center" id="S2.T3.8.4.2.5" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.4.2.5.1" style="font-size:80%;">36.97</span></td> </tr> <tr class="ltx_tr" id="S2.T3.8.5.3"> <th class="ltx_td ltx_align_left ltx_th ltx_th_row ltx_border_tt" id="S2.T3.8.5.3.1" rowspan="2" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text ltx_font_bold" id="S2.T3.8.5.3.1.1" style="font-size:80%;">CGP II</span></th> <td class="ltx_td ltx_align_center ltx_border_tt" id="S2.T3.8.5.3.2" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.5.3.2.1" style="font-size:80%;">CNCeleb 1&amp;2</span></td> <td class="ltx_td ltx_align_center ltx_border_tt" id="S2.T3.8.5.3.3" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.5.3.3.1" style="font-size:80%;">10.04</span></td> <td class="ltx_td ltx_align_center ltx_border_tt" id="S2.T3.8.5.3.4" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.5.3.4.1" style="font-size:80%;">-</span></td> <td class="ltx_td ltx_align_center ltx_border_tt" id="S2.T3.8.5.3.5" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.5.3.5.1" style="font-size:80%;">10.85</span></td> <td class="ltx_td ltx_align_center ltx_border_tt" id="S2.T3.8.5.3.6" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.5.3.6.1" style="font-size:80%;">40.76</span></td> </tr> <tr class="ltx_tr" id="S2.T3.8.6.4"> <td class="ltx_td ltx_align_center" id="S2.T3.8.6.4.1" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.6.4.1.1" style="font-size:80%;">Combination</span></td> <td class="ltx_td ltx_align_center" id="S2.T3.8.6.4.2" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.6.4.2.1" style="font-size:80%;">9.75</span></td> <td class="ltx_td ltx_align_center" id="S2.T3.8.6.4.3" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.6.4.3.1" style="font-size:80%;">-</span></td> <td class="ltx_td ltx_align_center" id="S2.T3.8.6.4.4" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.6.4.4.1" style="font-size:80%;">10.79</span></td> <td class="ltx_td ltx_align_center" id="S2.T3.8.6.4.5" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.6.4.5.1" style="font-size:80%;">40.17</span></td> </tr> <tr class="ltx_tr" id="S2.T3.8.7.5"> <th class="ltx_td ltx_align_left ltx_th ltx_th_row ltx_border_tt" id="S2.T3.8.7.5.1" rowspan="2" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text ltx_font_bold" id="S2.T3.8.7.5.1.1" style="font-size:80%;">CGP III</span></th> <td class="ltx_td ltx_align_center ltx_border_tt" id="S2.T3.8.7.5.2" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.7.5.2.1" style="font-size:80%;">CNCeleb 1&amp;2</span></td> <td class="ltx_td ltx_align_center ltx_border_tt" id="S2.T3.8.7.5.3" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.7.5.3.1" style="font-size:80%;">10.12</span></td> <td class="ltx_td ltx_align_center ltx_border_tt" id="S2.T3.8.7.5.4" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.7.5.4.1" style="font-size:80%;">-</span></td> <td class="ltx_td ltx_align_center ltx_border_tt" id="S2.T3.8.7.5.5" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.7.5.5.1" style="font-size:80%;">10.67</span></td> <td class="ltx_td ltx_align_center ltx_border_tt" id="S2.T3.8.7.5.6" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.7.5.6.1" style="font-size:80%;">39.62</span></td> </tr> <tr class="ltx_tr" id="S2.T3.8.8.6"> <td class="ltx_td ltx_align_center" id="S2.T3.8.8.6.1" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.8.6.1.1" style="font-size:80%;">Combination</span></td> <td class="ltx_td ltx_align_center" id="S2.T3.8.8.6.2" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.8.6.2.1" style="font-size:80%;">9.33</span></td> <td class="ltx_td ltx_align_center" id="S2.T3.8.8.6.3" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.8.6.3.1" style="font-size:80%;">-</span></td> <td class="ltx_td ltx_align_center" id="S2.T3.8.8.6.4" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.8.6.4.1" style="font-size:80%;">10.22</span></td> <td class="ltx_td ltx_align_center" id="S2.T3.8.8.6.5" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.8.6.5.1" style="font-size:80%;">38.49</span></td> </tr> <tr class="ltx_tr" id="S2.T3.8.9.7"> <th class="ltx_td ltx_align_left ltx_th ltx_th_row ltx_border_bb ltx_border_tt" id="S2.T3.8.9.7.1" rowspan="2" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text ltx_font_bold" id="S2.T3.8.9.7.1.1" style="font-size:80%;">CGP IV</span></th> <td class="ltx_td ltx_align_center ltx_border_tt" id="S2.T3.8.9.7.2" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.9.7.2.1" style="font-size:80%;">CNCeleb 1&amp;2</span></td> <td class="ltx_td ltx_align_center ltx_border_tt" id="S2.T3.8.9.7.3" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.9.7.3.1" style="font-size:80%;">9.59</span></td> <td class="ltx_td ltx_align_center ltx_border_tt" id="S2.T3.8.9.7.4" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.9.7.4.1" style="font-size:80%;">-</span></td> <td class="ltx_td ltx_align_center ltx_border_tt" id="S2.T3.8.9.7.5" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.9.7.5.1" style="font-size:80%;">10.24</span></td> <td class="ltx_td ltx_align_center ltx_border_tt" id="S2.T3.8.9.7.6" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.9.7.6.1" style="font-size:80%;">37.97</span></td> </tr> <tr class="ltx_tr" id="S2.T3.8.10.8"> <td class="ltx_td ltx_align_center ltx_border_bb" id="S2.T3.8.10.8.1" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.10.8.1.1" style="font-size:80%;">Combination</span></td> <td class="ltx_td ltx_align_center ltx_border_bb" id="S2.T3.8.10.8.2" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.10.8.2.1" style="font-size:80%;">9.21</span></td> <td class="ltx_td ltx_align_center ltx_border_bb" id="S2.T3.8.10.8.3" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.10.8.3.1" style="font-size:80%;">-</span></td> <td class="ltx_td ltx_align_center ltx_border_bb" id="S2.T3.8.10.8.4" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.10.8.4.1" style="font-size:80%;">9.86</span></td> <td class="ltx_td ltx_align_center ltx_border_bb" id="S2.T3.8.10.8.5" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S2.T3.8.10.8.5.1" style="font-size:80%;">37.42</span></td> </tr> </tbody> </table> </figure> <div class="ltx_para" id="S2.SS2.p3"> <p class="ltx_p" id="S2.SS2.p3.1"><span class="ltx_text" id="S2.SS2.p3.1.1" style="font-size:90%;">First and foremost, it is noteworthy that the ECAPA-TDNN model trained on the combination of CNCeleb 1&amp;2 and CNSpoof datasets exhibited a slightly consistent performance improvement on the CNCeleb.Eval dataset compared to that of the model trained solely on the CNCeleb 1&amp;2 dataset under all CGPs. This observation suggests that the copy-synthesis method </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S2.SS2.p3.1.2.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib35" title="">35</a>, <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib36" title="">36</a><span class="ltx_text" id="S2.SS2.p3.1.3.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S2.SS2.p3.1.4" style="font-size:90%;"> employed to generate the CNSpoof dataset can be considered as an effective data augmentation strategy in training an ASV model.</span></p> </div> <div class="ltx_para" id="S2.SS2.p4"> <p class="ltx_p" id="S2.SS2.p4.1"><span class="ltx_text" id="S2.SS2.p4.1.1" style="font-size:90%;">However, the critical focus shifts to the results obtained from the CNComplex dataset. As anticipated, the performance substantially deteriorated on this new dataset under all CGPs, regardless of which training dataset was used. This unequivocally indicates that the contemporary state-of-the-art ASV model is ill-equipped to address the complexities inherent in the new testing dataset, which contains simultaneous threats.</span></p> </div> </section> </section> <section class="ltx_section" id="S3"> <h2 class="ltx_title ltx_title_section" style="font-size:90%;"> <span class="ltx_tag ltx_tag_section">3 </span>Proposed integrated approach</h2> <section class="ltx_subsection" id="S3.SS1"> <h3 class="ltx_title ltx_title_subsection" style="font-size:90%;"> <span class="ltx_tag ltx_tag_subsection">3.1 </span>Overview</h3> <div class="ltx_para" id="S3.SS1.p1"> <p class="ltx_p" id="S3.SS1.p1.1"><span class="ltx_text" id="S3.SS1.p1.1.1" style="font-size:90%;">As described in the preceding section, ASV models were observed to exhibit vulnerabilities when confronted with complex scenarios. However, in real-world settings, ASV systems often encounter these threats simultaneously. Therefore, it is imperative to develop a robust system that can effectively counter these challenges in a concurrent manner.</span></p> </div> <div class="ltx_para" id="S3.SS1.p2"> <p class="ltx_p" id="S3.SS1.p2.1"><span class="ltx_text" id="S3.SS1.p2.1.1" style="font-size:90%;">In this section, building upon the foundation laid by the previous studies </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S3.SS1.p2.1.2.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib11" title="">11</a>, <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib27" title="">27</a>, <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib28" title="">28</a><span class="ltx_text" id="S3.SS1.p2.1.3.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S3.SS1.p2.1.4" style="font-size:90%;">, we present an integrated solution designed to address these challenges comprehensively. The proposed solution utilizes an asymmetric dual-path model equipped with a multi-task learning strategy, as depicted in Figure </span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S3.F2" style="font-size:90%;" title="Figure 2 ‣ 3.1 Overview ‣ 3 Proposed integrated approach ‣ Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_tag">2</span></a><span class="ltx_text" id="S3.SS1.p2.1.5" style="font-size:90%;">. This model is tasked with simultaneously handling the ASV task, the anti-spoofing task, and the spoofing-aware ASV task. Furthermore, we incorporate pair-wise learning and spoofing attack simulation into the meta-learning paradigms to train the model so that the model is equipped with the capability to address channel mismatch, spoofing attacks, and domain mismatch issues.</span></p> </div> <figure class="ltx_figure" id="S3.F2"><img alt="Refer to caption" class="ltx_graphics ltx_centering ltx_img_landscape" height="332" id="S3.F2.g1" src="x2.png" width="436"/> <figcaption class="ltx_caption ltx_centering" style="font-size:90%;"><span class="ltx_tag ltx_tag_figure"><span class="ltx_text ltx_font_bold" id="S3.F2.4.1.1">Fig. 2</span>: </span>Architecture of the dual-path model consisting of four submodules: a generic feature extractor, a speaker classifier, a spoof classifier, and a SASV binary classifier. Notably, the feature extractor adopts an asymmetric dual-path structure.</figcaption> </figure> <div class="ltx_para" id="S3.SS1.p3"> <p class="ltx_p" id="S3.SS1.p3.1"><span class="ltx_text" id="S3.SS1.p3.1.1" style="font-size:90%;">In comparison to traditional robust systems targeting a single threat, the proposed approach presents two distinct advantages. Firstly, it capitalizes on the interactions between the spoofing-aware attention back-end and the meta-learning-based generalization, resulting in the enhancement of both subsystems. Secondly, the end-to-end training methodology enables more efficient signal propagation to train a system robust against multiple risks. The initial gains observed in individual tasks in the previous studies </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S3.SS1.p3.1.2.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib11" title="">11</a>, <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib27" title="">27</a>, <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib28" title="">28</a><span class="ltx_text" id="S3.SS1.p3.1.3.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S3.SS1.p3.1.4" style="font-size:90%;"> clearly underscore the promising potential of this integrated approach.</span></p> </div> </section> <section class="ltx_subsection" id="S3.SS2"> <h3 class="ltx_title ltx_title_subsection" style="font-size:90%;"> <span class="ltx_tag ltx_tag_subsection">3.2 </span>Model architecture and objectives</h3> <div class="ltx_para" id="S3.SS2.p1"> <p class="ltx_p" id="S3.SS2.p1.1"><span class="ltx_text" id="S3.SS2.p1.1.1" style="font-size:90%;">Figure </span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S3.F2" style="font-size:90%;" title="Figure 2 ‣ 3.1 Overview ‣ 3 Proposed integrated approach ‣ Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_tag">2</span></a><span class="ltx_text" id="S3.SS2.p1.1.2" style="font-size:90%;"> provides a visual representation of the architecture of the proposed spoofing-aware ASV model. This model consists of four main components:</span></p> <ul class="ltx_itemize" id="S3.I1"> <li class="ltx_item" id="S3.I1.i1" style="list-style-type:none;"> <span class="ltx_tag ltx_tag_item">•</span> <div class="ltx_para" id="S3.I1.i1.p1"> <p class="ltx_p" id="S3.I1.i1.p1.1"><span class="ltx_text" id="S3.I1.i1.p1.1.1" style="font-size:90%;">Asymmetric dual-path feature extractor</span></p> </div> </li> <li class="ltx_item" id="S3.I1.i2" style="list-style-type:none;"> <span class="ltx_tag ltx_tag_item">•</span> <div class="ltx_para" id="S3.I1.i2.p1"> <p class="ltx_p" id="S3.I1.i2.p1.1"><span class="ltx_text" id="S3.I1.i2.p1.1.1" style="font-size:90%;">Speaker classifier</span></p> </div> </li> <li class="ltx_item" id="S3.I1.i3" style="list-style-type:none;"> <span class="ltx_tag ltx_tag_item">•</span> <div class="ltx_para" id="S3.I1.i3.p1"> <p class="ltx_p" id="S3.I1.i3.p1.1"><span class="ltx_text" id="S3.I1.i3.p1.1.1" style="font-size:90%;">Spoof classifier</span></p> </div> </li> <li class="ltx_item" id="S3.I1.i4" style="list-style-type:none;"> <span class="ltx_tag ltx_tag_item">•</span> <div class="ltx_para" id="S3.I1.i4.p1"> <p class="ltx_p" id="S3.I1.i4.p1.1"><span class="ltx_text" id="S3.I1.i4.p1.1.1" style="font-size:90%;">SASV binary classifier</span></p> </div> </li> </ul> <p class="ltx_p" id="S3.SS2.p1.2"><span class="ltx_text" id="S3.SS2.p1.2.1" style="font-size:90%;">Each of these components will be elaborated upon in detail in the subsequent section.</span></p> </div> <section class="ltx_subsubsection" id="S3.SS2.SSS1"> <h4 class="ltx_title ltx_title_subsubsection" style="font-size:90%;"> <span class="ltx_tag ltx_tag_subsubsection">3.2.1 </span>Asymmetric dual-path feature extractor</h4> <div class="ltx_para" id="S3.SS2.SSS1.p1"> <p class="ltx_p" id="S3.SS2.SSS1.p1.1"><span class="ltx_text" id="S3.SS2.SSS1.p1.1.1" style="font-size:90%;">As depicted in Figure </span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S3.F2" style="font-size:90%;" title="Figure 2 ‣ 3.1 Overview ‣ 3 Proposed integrated approach ‣ Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_tag">2</span></a><span class="ltx_text" id="S3.SS2.SSS1.p1.1.2" style="font-size:90%;">, the asymmetric dual-path feature extractor comprises N blocks, each with distinct parameters but an identical architecture. Within each block, two different branches share the same input, denoted as </span><math alttext="\boldsymbol{X}" class="ltx_Math" display="inline" id="S3.SS2.SSS1.p1.1.m1.1"><semantics id="S3.SS2.SSS1.p1.1.m1.1a"><mi id="S3.SS2.SSS1.p1.1.m1.1.1" mathsize="90%" xref="S3.SS2.SSS1.p1.1.m1.1.1.cmml">𝑿</mi><annotation-xml encoding="MathML-Content" id="S3.SS2.SSS1.p1.1.m1.1b"><ci id="S3.SS2.SSS1.p1.1.m1.1.1.cmml" xref="S3.SS2.SSS1.p1.1.m1.1.1">𝑿</ci></annotation-xml><annotation encoding="application/x-tex" id="S3.SS2.SSS1.p1.1.m1.1c">\boldsymbol{X}</annotation><annotation encoding="application/x-llamapun" id="S3.SS2.SSS1.p1.1.m1.1d">bold_italic_X</annotation></semantics></math><span class="ltx_text" id="S3.SS2.SSS1.p1.1.3" style="font-size:90%;">. The left multi-head attention branch, inspired by </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S3.SS2.SSS1.p1.1.4.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib37" title="">37</a>, <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib38" title="">38</a>, <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib39" title="">39</a>, <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib40" title="">40</a><span class="ltx_text" id="S3.SS2.SSS1.p1.1.5.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S3.SS2.SSS1.p1.1.6" style="font-size:90%;">, primarily focuses on extracting features for the anti-spoofing task. Conversely, the right SE-Res2Block, proposed in </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S3.SS2.SSS1.p1.1.7.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib7" title="">7</a><span class="ltx_text" id="S3.SS2.SSS1.p1.1.8.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S3.SS2.SSS1.p1.1.9" style="font-size:90%;">, is primarily responsible for feature extraction for the ASV task. The outputs of these two branches are combined through addition and layer normalization operations. The computational process can be expressed as follows:</span></p> <table class="ltx_equationgroup ltx_eqn_align ltx_eqn_table" id="S6.EGx1"> <tbody id="S3.E1"><tr class="ltx_equation ltx_eqn_row ltx_align_baseline"> <td class="ltx_eqn_cell ltx_eqn_center_padleft"></td> <td class="ltx_td ltx_align_right ltx_eqn_cell"><math alttext="\displaystyle\boldsymbol{H}_{left}^{n}" class="ltx_Math" display="inline" id="S3.E1.m1.1"><semantics id="S3.E1.m1.1a"><msubsup id="S3.E1.m1.1.1" xref="S3.E1.m1.1.1.cmml"><mi id="S3.E1.m1.1.1.2.2" mathsize="90%" xref="S3.E1.m1.1.1.2.2.cmml">𝑯</mi><mrow id="S3.E1.m1.1.1.2.3" xref="S3.E1.m1.1.1.2.3.cmml"><mi id="S3.E1.m1.1.1.2.3.2" mathsize="90%" xref="S3.E1.m1.1.1.2.3.2.cmml">l</mi><mo id="S3.E1.m1.1.1.2.3.1" xref="S3.E1.m1.1.1.2.3.1.cmml">⁢</mo><mi id="S3.E1.m1.1.1.2.3.3" mathsize="90%" xref="S3.E1.m1.1.1.2.3.3.cmml">e</mi><mo id="S3.E1.m1.1.1.2.3.1a" xref="S3.E1.m1.1.1.2.3.1.cmml">⁢</mo><mi id="S3.E1.m1.1.1.2.3.4" mathsize="90%" xref="S3.E1.m1.1.1.2.3.4.cmml">f</mi><mo id="S3.E1.m1.1.1.2.3.1b" xref="S3.E1.m1.1.1.2.3.1.cmml">⁢</mo><mi id="S3.E1.m1.1.1.2.3.5" mathsize="90%" xref="S3.E1.m1.1.1.2.3.5.cmml">t</mi></mrow><mi id="S3.E1.m1.1.1.3" mathsize="90%" xref="S3.E1.m1.1.1.3.cmml">n</mi></msubsup><annotation-xml encoding="MathML-Content" id="S3.E1.m1.1b"><apply id="S3.E1.m1.1.1.cmml" xref="S3.E1.m1.1.1"><csymbol cd="ambiguous" id="S3.E1.m1.1.1.1.cmml" xref="S3.E1.m1.1.1">superscript</csymbol><apply id="S3.E1.m1.1.1.2.cmml" xref="S3.E1.m1.1.1"><csymbol cd="ambiguous" id="S3.E1.m1.1.1.2.1.cmml" xref="S3.E1.m1.1.1">subscript</csymbol><ci id="S3.E1.m1.1.1.2.2.cmml" xref="S3.E1.m1.1.1.2.2">𝑯</ci><apply id="S3.E1.m1.1.1.2.3.cmml" xref="S3.E1.m1.1.1.2.3"><times id="S3.E1.m1.1.1.2.3.1.cmml" xref="S3.E1.m1.1.1.2.3.1"></times><ci id="S3.E1.m1.1.1.2.3.2.cmml" xref="S3.E1.m1.1.1.2.3.2">𝑙</ci><ci id="S3.E1.m1.1.1.2.3.3.cmml" xref="S3.E1.m1.1.1.2.3.3">𝑒</ci><ci id="S3.E1.m1.1.1.2.3.4.cmml" xref="S3.E1.m1.1.1.2.3.4">𝑓</ci><ci id="S3.E1.m1.1.1.2.3.5.cmml" xref="S3.E1.m1.1.1.2.3.5">𝑡</ci></apply></apply><ci id="S3.E1.m1.1.1.3.cmml" xref="S3.E1.m1.1.1.3">𝑛</ci></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.E1.m1.1c">\displaystyle\boldsymbol{H}_{left}^{n}</annotation><annotation encoding="application/x-llamapun" id="S3.E1.m1.1d">bold_italic_H start_POSTSUBSCRIPT italic_l italic_e italic_f italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT</annotation></semantics></math></td> <td class="ltx_td ltx_align_left ltx_eqn_cell"><math alttext="\displaystyle=f_{left}(\boldsymbol{X})," class="ltx_Math" display="inline" id="S3.E1.m2.2"><semantics id="S3.E1.m2.2a"><mrow id="S3.E1.m2.2.2.1" xref="S3.E1.m2.2.2.1.1.cmml"><mrow id="S3.E1.m2.2.2.1.1" xref="S3.E1.m2.2.2.1.1.cmml"><mi id="S3.E1.m2.2.2.1.1.2" xref="S3.E1.m2.2.2.1.1.2.cmml"></mi><mo id="S3.E1.m2.2.2.1.1.1" mathsize="90%" xref="S3.E1.m2.2.2.1.1.1.cmml">=</mo><mrow id="S3.E1.m2.2.2.1.1.3" xref="S3.E1.m2.2.2.1.1.3.cmml"><msub id="S3.E1.m2.2.2.1.1.3.2" xref="S3.E1.m2.2.2.1.1.3.2.cmml"><mi id="S3.E1.m2.2.2.1.1.3.2.2" mathsize="90%" xref="S3.E1.m2.2.2.1.1.3.2.2.cmml">f</mi><mrow id="S3.E1.m2.2.2.1.1.3.2.3" xref="S3.E1.m2.2.2.1.1.3.2.3.cmml"><mi id="S3.E1.m2.2.2.1.1.3.2.3.2" mathsize="90%" xref="S3.E1.m2.2.2.1.1.3.2.3.2.cmml">l</mi><mo id="S3.E1.m2.2.2.1.1.3.2.3.1" xref="S3.E1.m2.2.2.1.1.3.2.3.1.cmml">⁢</mo><mi id="S3.E1.m2.2.2.1.1.3.2.3.3" mathsize="90%" xref="S3.E1.m2.2.2.1.1.3.2.3.3.cmml">e</mi><mo id="S3.E1.m2.2.2.1.1.3.2.3.1a" xref="S3.E1.m2.2.2.1.1.3.2.3.1.cmml">⁢</mo><mi id="S3.E1.m2.2.2.1.1.3.2.3.4" mathsize="90%" xref="S3.E1.m2.2.2.1.1.3.2.3.4.cmml">f</mi><mo id="S3.E1.m2.2.2.1.1.3.2.3.1b" xref="S3.E1.m2.2.2.1.1.3.2.3.1.cmml">⁢</mo><mi id="S3.E1.m2.2.2.1.1.3.2.3.5" mathsize="90%" xref="S3.E1.m2.2.2.1.1.3.2.3.5.cmml">t</mi></mrow></msub><mo id="S3.E1.m2.2.2.1.1.3.1" xref="S3.E1.m2.2.2.1.1.3.1.cmml">⁢</mo><mrow id="S3.E1.m2.2.2.1.1.3.3.2" xref="S3.E1.m2.2.2.1.1.3.cmml"><mo id="S3.E1.m2.2.2.1.1.3.3.2.1" maxsize="90%" minsize="90%" xref="S3.E1.m2.2.2.1.1.3.cmml">(</mo><mi id="S3.E1.m2.1.1" mathsize="90%" xref="S3.E1.m2.1.1.cmml">𝑿</mi><mo id="S3.E1.m2.2.2.1.1.3.3.2.2" maxsize="90%" minsize="90%" xref="S3.E1.m2.2.2.1.1.3.cmml">)</mo></mrow></mrow></mrow><mo id="S3.E1.m2.2.2.1.2" mathsize="90%" xref="S3.E1.m2.2.2.1.1.cmml">,</mo></mrow><annotation-xml encoding="MathML-Content" id="S3.E1.m2.2b"><apply id="S3.E1.m2.2.2.1.1.cmml" xref="S3.E1.m2.2.2.1"><eq id="S3.E1.m2.2.2.1.1.1.cmml" xref="S3.E1.m2.2.2.1.1.1"></eq><csymbol cd="latexml" id="S3.E1.m2.2.2.1.1.2.cmml" xref="S3.E1.m2.2.2.1.1.2">absent</csymbol><apply id="S3.E1.m2.2.2.1.1.3.cmml" xref="S3.E1.m2.2.2.1.1.3"><times id="S3.E1.m2.2.2.1.1.3.1.cmml" xref="S3.E1.m2.2.2.1.1.3.1"></times><apply id="S3.E1.m2.2.2.1.1.3.2.cmml" xref="S3.E1.m2.2.2.1.1.3.2"><csymbol cd="ambiguous" id="S3.E1.m2.2.2.1.1.3.2.1.cmml" xref="S3.E1.m2.2.2.1.1.3.2">subscript</csymbol><ci id="S3.E1.m2.2.2.1.1.3.2.2.cmml" xref="S3.E1.m2.2.2.1.1.3.2.2">𝑓</ci><apply id="S3.E1.m2.2.2.1.1.3.2.3.cmml" xref="S3.E1.m2.2.2.1.1.3.2.3"><times id="S3.E1.m2.2.2.1.1.3.2.3.1.cmml" xref="S3.E1.m2.2.2.1.1.3.2.3.1"></times><ci id="S3.E1.m2.2.2.1.1.3.2.3.2.cmml" xref="S3.E1.m2.2.2.1.1.3.2.3.2">𝑙</ci><ci id="S3.E1.m2.2.2.1.1.3.2.3.3.cmml" xref="S3.E1.m2.2.2.1.1.3.2.3.3">𝑒</ci><ci id="S3.E1.m2.2.2.1.1.3.2.3.4.cmml" xref="S3.E1.m2.2.2.1.1.3.2.3.4">𝑓</ci><ci id="S3.E1.m2.2.2.1.1.3.2.3.5.cmml" xref="S3.E1.m2.2.2.1.1.3.2.3.5">𝑡</ci></apply></apply><ci id="S3.E1.m2.1.1.cmml" xref="S3.E1.m2.1.1">𝑿</ci></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.E1.m2.2c">\displaystyle=f_{left}(\boldsymbol{X}),</annotation><annotation encoding="application/x-llamapun" id="S3.E1.m2.2d">= italic_f start_POSTSUBSCRIPT italic_l italic_e italic_f italic_t end_POSTSUBSCRIPT ( bold_italic_X ) ,</annotation></semantics></math></td> <td class="ltx_eqn_cell ltx_eqn_center_padright"></td> <td class="ltx_eqn_cell ltx_eqn_eqno ltx_align_middle ltx_align_right" rowspan="1"><span class="ltx_tag ltx_tag_equation ltx_align_right">(1)</span></td> </tr></tbody> <tbody id="S3.E2"><tr class="ltx_equation ltx_eqn_row ltx_align_baseline"> <td class="ltx_eqn_cell ltx_eqn_center_padleft"></td> <td class="ltx_td ltx_align_right ltx_eqn_cell"><math alttext="\displaystyle\boldsymbol{H}_{right}^{n}" class="ltx_Math" display="inline" id="S3.E2.m1.1"><semantics id="S3.E2.m1.1a"><msubsup id="S3.E2.m1.1.1" xref="S3.E2.m1.1.1.cmml"><mi id="S3.E2.m1.1.1.2.2" mathsize="90%" xref="S3.E2.m1.1.1.2.2.cmml">𝑯</mi><mrow id="S3.E2.m1.1.1.2.3" xref="S3.E2.m1.1.1.2.3.cmml"><mi id="S3.E2.m1.1.1.2.3.2" mathsize="90%" xref="S3.E2.m1.1.1.2.3.2.cmml">r</mi><mo id="S3.E2.m1.1.1.2.3.1" xref="S3.E2.m1.1.1.2.3.1.cmml">⁢</mo><mi id="S3.E2.m1.1.1.2.3.3" mathsize="90%" xref="S3.E2.m1.1.1.2.3.3.cmml">i</mi><mo id="S3.E2.m1.1.1.2.3.1a" xref="S3.E2.m1.1.1.2.3.1.cmml">⁢</mo><mi id="S3.E2.m1.1.1.2.3.4" mathsize="90%" xref="S3.E2.m1.1.1.2.3.4.cmml">g</mi><mo id="S3.E2.m1.1.1.2.3.1b" xref="S3.E2.m1.1.1.2.3.1.cmml">⁢</mo><mi id="S3.E2.m1.1.1.2.3.5" mathsize="90%" xref="S3.E2.m1.1.1.2.3.5.cmml">h</mi><mo id="S3.E2.m1.1.1.2.3.1c" xref="S3.E2.m1.1.1.2.3.1.cmml">⁢</mo><mi id="S3.E2.m1.1.1.2.3.6" mathsize="90%" xref="S3.E2.m1.1.1.2.3.6.cmml">t</mi></mrow><mi id="S3.E2.m1.1.1.3" mathsize="90%" xref="S3.E2.m1.1.1.3.cmml">n</mi></msubsup><annotation-xml encoding="MathML-Content" id="S3.E2.m1.1b"><apply id="S3.E2.m1.1.1.cmml" xref="S3.E2.m1.1.1"><csymbol cd="ambiguous" id="S3.E2.m1.1.1.1.cmml" xref="S3.E2.m1.1.1">superscript</csymbol><apply id="S3.E2.m1.1.1.2.cmml" xref="S3.E2.m1.1.1"><csymbol cd="ambiguous" id="S3.E2.m1.1.1.2.1.cmml" xref="S3.E2.m1.1.1">subscript</csymbol><ci id="S3.E2.m1.1.1.2.2.cmml" xref="S3.E2.m1.1.1.2.2">𝑯</ci><apply id="S3.E2.m1.1.1.2.3.cmml" xref="S3.E2.m1.1.1.2.3"><times id="S3.E2.m1.1.1.2.3.1.cmml" xref="S3.E2.m1.1.1.2.3.1"></times><ci id="S3.E2.m1.1.1.2.3.2.cmml" xref="S3.E2.m1.1.1.2.3.2">𝑟</ci><ci id="S3.E2.m1.1.1.2.3.3.cmml" xref="S3.E2.m1.1.1.2.3.3">𝑖</ci><ci id="S3.E2.m1.1.1.2.3.4.cmml" xref="S3.E2.m1.1.1.2.3.4">𝑔</ci><ci id="S3.E2.m1.1.1.2.3.5.cmml" xref="S3.E2.m1.1.1.2.3.5">ℎ</ci><ci id="S3.E2.m1.1.1.2.3.6.cmml" xref="S3.E2.m1.1.1.2.3.6">𝑡</ci></apply></apply><ci id="S3.E2.m1.1.1.3.cmml" xref="S3.E2.m1.1.1.3">𝑛</ci></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.E2.m1.1c">\displaystyle\boldsymbol{H}_{right}^{n}</annotation><annotation encoding="application/x-llamapun" id="S3.E2.m1.1d">bold_italic_H start_POSTSUBSCRIPT italic_r italic_i italic_g italic_h italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT</annotation></semantics></math></td> <td class="ltx_td ltx_align_left ltx_eqn_cell"><math alttext="\displaystyle=f_{right}(\boldsymbol{X})," class="ltx_Math" display="inline" id="S3.E2.m2.2"><semantics id="S3.E2.m2.2a"><mrow id="S3.E2.m2.2.2.1" xref="S3.E2.m2.2.2.1.1.cmml"><mrow id="S3.E2.m2.2.2.1.1" xref="S3.E2.m2.2.2.1.1.cmml"><mi id="S3.E2.m2.2.2.1.1.2" xref="S3.E2.m2.2.2.1.1.2.cmml"></mi><mo id="S3.E2.m2.2.2.1.1.1" mathsize="90%" xref="S3.E2.m2.2.2.1.1.1.cmml">=</mo><mrow id="S3.E2.m2.2.2.1.1.3" xref="S3.E2.m2.2.2.1.1.3.cmml"><msub id="S3.E2.m2.2.2.1.1.3.2" xref="S3.E2.m2.2.2.1.1.3.2.cmml"><mi id="S3.E2.m2.2.2.1.1.3.2.2" mathsize="90%" xref="S3.E2.m2.2.2.1.1.3.2.2.cmml">f</mi><mrow id="S3.E2.m2.2.2.1.1.3.2.3" xref="S3.E2.m2.2.2.1.1.3.2.3.cmml"><mi id="S3.E2.m2.2.2.1.1.3.2.3.2" mathsize="90%" xref="S3.E2.m2.2.2.1.1.3.2.3.2.cmml">r</mi><mo id="S3.E2.m2.2.2.1.1.3.2.3.1" xref="S3.E2.m2.2.2.1.1.3.2.3.1.cmml">⁢</mo><mi id="S3.E2.m2.2.2.1.1.3.2.3.3" mathsize="90%" xref="S3.E2.m2.2.2.1.1.3.2.3.3.cmml">i</mi><mo id="S3.E2.m2.2.2.1.1.3.2.3.1a" xref="S3.E2.m2.2.2.1.1.3.2.3.1.cmml">⁢</mo><mi id="S3.E2.m2.2.2.1.1.3.2.3.4" mathsize="90%" xref="S3.E2.m2.2.2.1.1.3.2.3.4.cmml">g</mi><mo id="S3.E2.m2.2.2.1.1.3.2.3.1b" xref="S3.E2.m2.2.2.1.1.3.2.3.1.cmml">⁢</mo><mi id="S3.E2.m2.2.2.1.1.3.2.3.5" mathsize="90%" xref="S3.E2.m2.2.2.1.1.3.2.3.5.cmml">h</mi><mo id="S3.E2.m2.2.2.1.1.3.2.3.1c" xref="S3.E2.m2.2.2.1.1.3.2.3.1.cmml">⁢</mo><mi id="S3.E2.m2.2.2.1.1.3.2.3.6" mathsize="90%" xref="S3.E2.m2.2.2.1.1.3.2.3.6.cmml">t</mi></mrow></msub><mo id="S3.E2.m2.2.2.1.1.3.1" xref="S3.E2.m2.2.2.1.1.3.1.cmml">⁢</mo><mrow id="S3.E2.m2.2.2.1.1.3.3.2" xref="S3.E2.m2.2.2.1.1.3.cmml"><mo id="S3.E2.m2.2.2.1.1.3.3.2.1" maxsize="90%" minsize="90%" xref="S3.E2.m2.2.2.1.1.3.cmml">(</mo><mi id="S3.E2.m2.1.1" mathsize="90%" xref="S3.E2.m2.1.1.cmml">𝑿</mi><mo id="S3.E2.m2.2.2.1.1.3.3.2.2" maxsize="90%" minsize="90%" xref="S3.E2.m2.2.2.1.1.3.cmml">)</mo></mrow></mrow></mrow><mo id="S3.E2.m2.2.2.1.2" mathsize="90%" xref="S3.E2.m2.2.2.1.1.cmml">,</mo></mrow><annotation-xml encoding="MathML-Content" id="S3.E2.m2.2b"><apply id="S3.E2.m2.2.2.1.1.cmml" xref="S3.E2.m2.2.2.1"><eq id="S3.E2.m2.2.2.1.1.1.cmml" xref="S3.E2.m2.2.2.1.1.1"></eq><csymbol cd="latexml" id="S3.E2.m2.2.2.1.1.2.cmml" xref="S3.E2.m2.2.2.1.1.2">absent</csymbol><apply id="S3.E2.m2.2.2.1.1.3.cmml" xref="S3.E2.m2.2.2.1.1.3"><times id="S3.E2.m2.2.2.1.1.3.1.cmml" xref="S3.E2.m2.2.2.1.1.3.1"></times><apply id="S3.E2.m2.2.2.1.1.3.2.cmml" xref="S3.E2.m2.2.2.1.1.3.2"><csymbol cd="ambiguous" id="S3.E2.m2.2.2.1.1.3.2.1.cmml" xref="S3.E2.m2.2.2.1.1.3.2">subscript</csymbol><ci id="S3.E2.m2.2.2.1.1.3.2.2.cmml" xref="S3.E2.m2.2.2.1.1.3.2.2">𝑓</ci><apply id="S3.E2.m2.2.2.1.1.3.2.3.cmml" xref="S3.E2.m2.2.2.1.1.3.2.3"><times id="S3.E2.m2.2.2.1.1.3.2.3.1.cmml" xref="S3.E2.m2.2.2.1.1.3.2.3.1"></times><ci id="S3.E2.m2.2.2.1.1.3.2.3.2.cmml" xref="S3.E2.m2.2.2.1.1.3.2.3.2">𝑟</ci><ci id="S3.E2.m2.2.2.1.1.3.2.3.3.cmml" xref="S3.E2.m2.2.2.1.1.3.2.3.3">𝑖</ci><ci id="S3.E2.m2.2.2.1.1.3.2.3.4.cmml" xref="S3.E2.m2.2.2.1.1.3.2.3.4">𝑔</ci><ci id="S3.E2.m2.2.2.1.1.3.2.3.5.cmml" xref="S3.E2.m2.2.2.1.1.3.2.3.5">ℎ</ci><ci id="S3.E2.m2.2.2.1.1.3.2.3.6.cmml" xref="S3.E2.m2.2.2.1.1.3.2.3.6">𝑡</ci></apply></apply><ci id="S3.E2.m2.1.1.cmml" xref="S3.E2.m2.1.1">𝑿</ci></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.E2.m2.2c">\displaystyle=f_{right}(\boldsymbol{X}),</annotation><annotation encoding="application/x-llamapun" id="S3.E2.m2.2d">= italic_f start_POSTSUBSCRIPT italic_r italic_i italic_g italic_h italic_t end_POSTSUBSCRIPT ( bold_italic_X ) ,</annotation></semantics></math></td> <td class="ltx_eqn_cell ltx_eqn_center_padright"></td> <td class="ltx_eqn_cell ltx_eqn_eqno ltx_align_middle ltx_align_right" rowspan="1"><span class="ltx_tag ltx_tag_equation ltx_align_right">(2)</span></td> </tr></tbody> <tbody id="S3.E3"><tr class="ltx_equation ltx_eqn_row ltx_align_baseline"> <td class="ltx_eqn_cell ltx_eqn_center_padleft"></td> <td class="ltx_td ltx_align_right ltx_eqn_cell"><math alttext="\displaystyle\boldsymbol{H}_{fuse}^{n}" class="ltx_Math" display="inline" id="S3.E3.m1.1"><semantics id="S3.E3.m1.1a"><msubsup id="S3.E3.m1.1.1" xref="S3.E3.m1.1.1.cmml"><mi id="S3.E3.m1.1.1.2.2" mathsize="90%" xref="S3.E3.m1.1.1.2.2.cmml">𝑯</mi><mrow id="S3.E3.m1.1.1.2.3" xref="S3.E3.m1.1.1.2.3.cmml"><mi id="S3.E3.m1.1.1.2.3.2" mathsize="90%" xref="S3.E3.m1.1.1.2.3.2.cmml">f</mi><mo id="S3.E3.m1.1.1.2.3.1" xref="S3.E3.m1.1.1.2.3.1.cmml">⁢</mo><mi id="S3.E3.m1.1.1.2.3.3" mathsize="90%" xref="S3.E3.m1.1.1.2.3.3.cmml">u</mi><mo id="S3.E3.m1.1.1.2.3.1a" xref="S3.E3.m1.1.1.2.3.1.cmml">⁢</mo><mi id="S3.E3.m1.1.1.2.3.4" mathsize="90%" xref="S3.E3.m1.1.1.2.3.4.cmml">s</mi><mo id="S3.E3.m1.1.1.2.3.1b" xref="S3.E3.m1.1.1.2.3.1.cmml">⁢</mo><mi id="S3.E3.m1.1.1.2.3.5" mathsize="90%" xref="S3.E3.m1.1.1.2.3.5.cmml">e</mi></mrow><mi id="S3.E3.m1.1.1.3" mathsize="90%" xref="S3.E3.m1.1.1.3.cmml">n</mi></msubsup><annotation-xml encoding="MathML-Content" id="S3.E3.m1.1b"><apply id="S3.E3.m1.1.1.cmml" xref="S3.E3.m1.1.1"><csymbol cd="ambiguous" id="S3.E3.m1.1.1.1.cmml" xref="S3.E3.m1.1.1">superscript</csymbol><apply id="S3.E3.m1.1.1.2.cmml" xref="S3.E3.m1.1.1"><csymbol cd="ambiguous" id="S3.E3.m1.1.1.2.1.cmml" xref="S3.E3.m1.1.1">subscript</csymbol><ci id="S3.E3.m1.1.1.2.2.cmml" xref="S3.E3.m1.1.1.2.2">𝑯</ci><apply id="S3.E3.m1.1.1.2.3.cmml" xref="S3.E3.m1.1.1.2.3"><times id="S3.E3.m1.1.1.2.3.1.cmml" xref="S3.E3.m1.1.1.2.3.1"></times><ci id="S3.E3.m1.1.1.2.3.2.cmml" xref="S3.E3.m1.1.1.2.3.2">𝑓</ci><ci id="S3.E3.m1.1.1.2.3.3.cmml" xref="S3.E3.m1.1.1.2.3.3">𝑢</ci><ci id="S3.E3.m1.1.1.2.3.4.cmml" xref="S3.E3.m1.1.1.2.3.4">𝑠</ci><ci id="S3.E3.m1.1.1.2.3.5.cmml" xref="S3.E3.m1.1.1.2.3.5">𝑒</ci></apply></apply><ci id="S3.E3.m1.1.1.3.cmml" xref="S3.E3.m1.1.1.3">𝑛</ci></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.E3.m1.1c">\displaystyle\boldsymbol{H}_{fuse}^{n}</annotation><annotation encoding="application/x-llamapun" id="S3.E3.m1.1d">bold_italic_H start_POSTSUBSCRIPT italic_f italic_u italic_s italic_e end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT</annotation></semantics></math></td> <td class="ltx_td ltx_align_left ltx_eqn_cell"><math alttext="\displaystyle=\text{LN}(\boldsymbol{H}_{spoof}^{n}+\boldsymbol{H}_{asv}^{n})," class="ltx_Math" display="inline" id="S3.E3.m2.1"><semantics id="S3.E3.m2.1a"><mrow id="S3.E3.m2.1.1.1" xref="S3.E3.m2.1.1.1.1.cmml"><mrow id="S3.E3.m2.1.1.1.1" xref="S3.E3.m2.1.1.1.1.cmml"><mi id="S3.E3.m2.1.1.1.1.3" xref="S3.E3.m2.1.1.1.1.3.cmml"></mi><mo id="S3.E3.m2.1.1.1.1.2" mathsize="90%" xref="S3.E3.m2.1.1.1.1.2.cmml">=</mo><mrow id="S3.E3.m2.1.1.1.1.1" xref="S3.E3.m2.1.1.1.1.1.cmml"><mtext id="S3.E3.m2.1.1.1.1.1.3" mathsize="90%" xref="S3.E3.m2.1.1.1.1.1.3a.cmml">LN</mtext><mo id="S3.E3.m2.1.1.1.1.1.2" xref="S3.E3.m2.1.1.1.1.1.2.cmml">⁢</mo><mrow id="S3.E3.m2.1.1.1.1.1.1.1" xref="S3.E3.m2.1.1.1.1.1.1.1.1.cmml"><mo id="S3.E3.m2.1.1.1.1.1.1.1.2" maxsize="90%" minsize="90%" xref="S3.E3.m2.1.1.1.1.1.1.1.1.cmml">(</mo><mrow id="S3.E3.m2.1.1.1.1.1.1.1.1" xref="S3.E3.m2.1.1.1.1.1.1.1.1.cmml"><msubsup id="S3.E3.m2.1.1.1.1.1.1.1.1.2" xref="S3.E3.m2.1.1.1.1.1.1.1.1.2.cmml"><mi id="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.2" mathsize="90%" xref="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.2.cmml">𝑯</mi><mrow id="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.3" xref="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.3.cmml"><mi id="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.3.2" mathsize="90%" xref="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.3.2.cmml">s</mi><mo id="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.3.1" xref="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.3.1.cmml">⁢</mo><mi id="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.3.3" mathsize="90%" xref="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.3.3.cmml">p</mi><mo id="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.3.1a" xref="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.3.1.cmml">⁢</mo><mi id="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.3.4" mathsize="90%" xref="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.3.4.cmml">o</mi><mo id="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.3.1b" xref="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.3.1.cmml">⁢</mo><mi id="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.3.5" mathsize="90%" xref="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.3.5.cmml">o</mi><mo id="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.3.1c" xref="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.3.1.cmml">⁢</mo><mi id="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.3.6" mathsize="90%" xref="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.3.6.cmml">f</mi></mrow><mi id="S3.E3.m2.1.1.1.1.1.1.1.1.2.3" mathsize="90%" xref="S3.E3.m2.1.1.1.1.1.1.1.1.2.3.cmml">n</mi></msubsup><mo id="S3.E3.m2.1.1.1.1.1.1.1.1.1" mathsize="90%" xref="S3.E3.m2.1.1.1.1.1.1.1.1.1.cmml">+</mo><msubsup id="S3.E3.m2.1.1.1.1.1.1.1.1.3" xref="S3.E3.m2.1.1.1.1.1.1.1.1.3.cmml"><mi id="S3.E3.m2.1.1.1.1.1.1.1.1.3.2.2" mathsize="90%" xref="S3.E3.m2.1.1.1.1.1.1.1.1.3.2.2.cmml">𝑯</mi><mrow id="S3.E3.m2.1.1.1.1.1.1.1.1.3.2.3" xref="S3.E3.m2.1.1.1.1.1.1.1.1.3.2.3.cmml"><mi id="S3.E3.m2.1.1.1.1.1.1.1.1.3.2.3.2" mathsize="90%" xref="S3.E3.m2.1.1.1.1.1.1.1.1.3.2.3.2.cmml">a</mi><mo id="S3.E3.m2.1.1.1.1.1.1.1.1.3.2.3.1" xref="S3.E3.m2.1.1.1.1.1.1.1.1.3.2.3.1.cmml">⁢</mo><mi id="S3.E3.m2.1.1.1.1.1.1.1.1.3.2.3.3" mathsize="90%" xref="S3.E3.m2.1.1.1.1.1.1.1.1.3.2.3.3.cmml">s</mi><mo id="S3.E3.m2.1.1.1.1.1.1.1.1.3.2.3.1a" xref="S3.E3.m2.1.1.1.1.1.1.1.1.3.2.3.1.cmml">⁢</mo><mi id="S3.E3.m2.1.1.1.1.1.1.1.1.3.2.3.4" mathsize="90%" xref="S3.E3.m2.1.1.1.1.1.1.1.1.3.2.3.4.cmml">v</mi></mrow><mi id="S3.E3.m2.1.1.1.1.1.1.1.1.3.3" mathsize="90%" xref="S3.E3.m2.1.1.1.1.1.1.1.1.3.3.cmml">n</mi></msubsup></mrow><mo id="S3.E3.m2.1.1.1.1.1.1.1.3" maxsize="90%" minsize="90%" xref="S3.E3.m2.1.1.1.1.1.1.1.1.cmml">)</mo></mrow></mrow></mrow><mo id="S3.E3.m2.1.1.1.2" mathsize="90%" xref="S3.E3.m2.1.1.1.1.cmml">,</mo></mrow><annotation-xml encoding="MathML-Content" id="S3.E3.m2.1b"><apply id="S3.E3.m2.1.1.1.1.cmml" xref="S3.E3.m2.1.1.1"><eq id="S3.E3.m2.1.1.1.1.2.cmml" xref="S3.E3.m2.1.1.1.1.2"></eq><csymbol cd="latexml" id="S3.E3.m2.1.1.1.1.3.cmml" xref="S3.E3.m2.1.1.1.1.3">absent</csymbol><apply id="S3.E3.m2.1.1.1.1.1.cmml" xref="S3.E3.m2.1.1.1.1.1"><times id="S3.E3.m2.1.1.1.1.1.2.cmml" xref="S3.E3.m2.1.1.1.1.1.2"></times><ci id="S3.E3.m2.1.1.1.1.1.3a.cmml" xref="S3.E3.m2.1.1.1.1.1.3"><mtext id="S3.E3.m2.1.1.1.1.1.3.cmml" mathsize="90%" xref="S3.E3.m2.1.1.1.1.1.3">LN</mtext></ci><apply id="S3.E3.m2.1.1.1.1.1.1.1.1.cmml" xref="S3.E3.m2.1.1.1.1.1.1.1"><plus id="S3.E3.m2.1.1.1.1.1.1.1.1.1.cmml" xref="S3.E3.m2.1.1.1.1.1.1.1.1.1"></plus><apply id="S3.E3.m2.1.1.1.1.1.1.1.1.2.cmml" xref="S3.E3.m2.1.1.1.1.1.1.1.1.2"><csymbol cd="ambiguous" id="S3.E3.m2.1.1.1.1.1.1.1.1.2.1.cmml" xref="S3.E3.m2.1.1.1.1.1.1.1.1.2">superscript</csymbol><apply id="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.cmml" xref="S3.E3.m2.1.1.1.1.1.1.1.1.2"><csymbol cd="ambiguous" id="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.1.cmml" xref="S3.E3.m2.1.1.1.1.1.1.1.1.2">subscript</csymbol><ci id="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.2.cmml" xref="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.2">𝑯</ci><apply id="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.3.cmml" xref="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.3"><times id="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.3.1.cmml" xref="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.3.1"></times><ci id="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.3.2.cmml" xref="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.3.2">𝑠</ci><ci id="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.3.3.cmml" xref="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.3.3">𝑝</ci><ci id="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.3.4.cmml" xref="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.3.4">𝑜</ci><ci id="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.3.5.cmml" xref="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.3.5">𝑜</ci><ci id="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.3.6.cmml" xref="S3.E3.m2.1.1.1.1.1.1.1.1.2.2.3.6">𝑓</ci></apply></apply><ci id="S3.E3.m2.1.1.1.1.1.1.1.1.2.3.cmml" xref="S3.E3.m2.1.1.1.1.1.1.1.1.2.3">𝑛</ci></apply><apply id="S3.E3.m2.1.1.1.1.1.1.1.1.3.cmml" xref="S3.E3.m2.1.1.1.1.1.1.1.1.3"><csymbol cd="ambiguous" id="S3.E3.m2.1.1.1.1.1.1.1.1.3.1.cmml" xref="S3.E3.m2.1.1.1.1.1.1.1.1.3">superscript</csymbol><apply id="S3.E3.m2.1.1.1.1.1.1.1.1.3.2.cmml" xref="S3.E3.m2.1.1.1.1.1.1.1.1.3"><csymbol cd="ambiguous" id="S3.E3.m2.1.1.1.1.1.1.1.1.3.2.1.cmml" xref="S3.E3.m2.1.1.1.1.1.1.1.1.3">subscript</csymbol><ci id="S3.E3.m2.1.1.1.1.1.1.1.1.3.2.2.cmml" xref="S3.E3.m2.1.1.1.1.1.1.1.1.3.2.2">𝑯</ci><apply id="S3.E3.m2.1.1.1.1.1.1.1.1.3.2.3.cmml" xref="S3.E3.m2.1.1.1.1.1.1.1.1.3.2.3"><times id="S3.E3.m2.1.1.1.1.1.1.1.1.3.2.3.1.cmml" xref="S3.E3.m2.1.1.1.1.1.1.1.1.3.2.3.1"></times><ci id="S3.E3.m2.1.1.1.1.1.1.1.1.3.2.3.2.cmml" xref="S3.E3.m2.1.1.1.1.1.1.1.1.3.2.3.2">𝑎</ci><ci id="S3.E3.m2.1.1.1.1.1.1.1.1.3.2.3.3.cmml" xref="S3.E3.m2.1.1.1.1.1.1.1.1.3.2.3.3">𝑠</ci><ci id="S3.E3.m2.1.1.1.1.1.1.1.1.3.2.3.4.cmml" xref="S3.E3.m2.1.1.1.1.1.1.1.1.3.2.3.4">𝑣</ci></apply></apply><ci id="S3.E3.m2.1.1.1.1.1.1.1.1.3.3.cmml" xref="S3.E3.m2.1.1.1.1.1.1.1.1.3.3">𝑛</ci></apply></apply></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.E3.m2.1c">\displaystyle=\text{LN}(\boldsymbol{H}_{spoof}^{n}+\boldsymbol{H}_{asv}^{n}),</annotation><annotation encoding="application/x-llamapun" id="S3.E3.m2.1d">= LN ( bold_italic_H start_POSTSUBSCRIPT italic_s italic_p italic_o italic_o italic_f end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT + bold_italic_H start_POSTSUBSCRIPT italic_a italic_s italic_v end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ) ,</annotation></semantics></math></td> <td class="ltx_eqn_cell ltx_eqn_center_padright"></td> <td class="ltx_eqn_cell ltx_eqn_eqno ltx_align_middle ltx_align_right" rowspan="1"><span class="ltx_tag ltx_tag_equation ltx_align_right">(3)</span></td> </tr></tbody> </table> <p class="ltx_p" id="S3.SS2.SSS1.p1.8"><span class="ltx_text" id="S3.SS2.SSS1.p1.8.1" style="font-size:90%;">where </span><math alttext="\boldsymbol{H}_{left}^{n}" class="ltx_Math" display="inline" id="S3.SS2.SSS1.p1.2.m1.1"><semantics id="S3.SS2.SSS1.p1.2.m1.1a"><msubsup id="S3.SS2.SSS1.p1.2.m1.1.1" xref="S3.SS2.SSS1.p1.2.m1.1.1.cmml"><mi id="S3.SS2.SSS1.p1.2.m1.1.1.2.2" mathsize="90%" xref="S3.SS2.SSS1.p1.2.m1.1.1.2.2.cmml">𝑯</mi><mrow id="S3.SS2.SSS1.p1.2.m1.1.1.2.3" xref="S3.SS2.SSS1.p1.2.m1.1.1.2.3.cmml"><mi id="S3.SS2.SSS1.p1.2.m1.1.1.2.3.2" mathsize="90%" xref="S3.SS2.SSS1.p1.2.m1.1.1.2.3.2.cmml">l</mi><mo id="S3.SS2.SSS1.p1.2.m1.1.1.2.3.1" xref="S3.SS2.SSS1.p1.2.m1.1.1.2.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS1.p1.2.m1.1.1.2.3.3" mathsize="90%" xref="S3.SS2.SSS1.p1.2.m1.1.1.2.3.3.cmml">e</mi><mo id="S3.SS2.SSS1.p1.2.m1.1.1.2.3.1a" xref="S3.SS2.SSS1.p1.2.m1.1.1.2.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS1.p1.2.m1.1.1.2.3.4" mathsize="90%" xref="S3.SS2.SSS1.p1.2.m1.1.1.2.3.4.cmml">f</mi><mo id="S3.SS2.SSS1.p1.2.m1.1.1.2.3.1b" xref="S3.SS2.SSS1.p1.2.m1.1.1.2.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS1.p1.2.m1.1.1.2.3.5" mathsize="90%" xref="S3.SS2.SSS1.p1.2.m1.1.1.2.3.5.cmml">t</mi></mrow><mi id="S3.SS2.SSS1.p1.2.m1.1.1.3" mathsize="90%" xref="S3.SS2.SSS1.p1.2.m1.1.1.3.cmml">n</mi></msubsup><annotation-xml encoding="MathML-Content" id="S3.SS2.SSS1.p1.2.m1.1b"><apply id="S3.SS2.SSS1.p1.2.m1.1.1.cmml" xref="S3.SS2.SSS1.p1.2.m1.1.1"><csymbol cd="ambiguous" id="S3.SS2.SSS1.p1.2.m1.1.1.1.cmml" xref="S3.SS2.SSS1.p1.2.m1.1.1">superscript</csymbol><apply id="S3.SS2.SSS1.p1.2.m1.1.1.2.cmml" xref="S3.SS2.SSS1.p1.2.m1.1.1"><csymbol cd="ambiguous" id="S3.SS2.SSS1.p1.2.m1.1.1.2.1.cmml" xref="S3.SS2.SSS1.p1.2.m1.1.1">subscript</csymbol><ci id="S3.SS2.SSS1.p1.2.m1.1.1.2.2.cmml" xref="S3.SS2.SSS1.p1.2.m1.1.1.2.2">𝑯</ci><apply id="S3.SS2.SSS1.p1.2.m1.1.1.2.3.cmml" xref="S3.SS2.SSS1.p1.2.m1.1.1.2.3"><times id="S3.SS2.SSS1.p1.2.m1.1.1.2.3.1.cmml" xref="S3.SS2.SSS1.p1.2.m1.1.1.2.3.1"></times><ci id="S3.SS2.SSS1.p1.2.m1.1.1.2.3.2.cmml" xref="S3.SS2.SSS1.p1.2.m1.1.1.2.3.2">𝑙</ci><ci id="S3.SS2.SSS1.p1.2.m1.1.1.2.3.3.cmml" xref="S3.SS2.SSS1.p1.2.m1.1.1.2.3.3">𝑒</ci><ci id="S3.SS2.SSS1.p1.2.m1.1.1.2.3.4.cmml" xref="S3.SS2.SSS1.p1.2.m1.1.1.2.3.4">𝑓</ci><ci id="S3.SS2.SSS1.p1.2.m1.1.1.2.3.5.cmml" xref="S3.SS2.SSS1.p1.2.m1.1.1.2.3.5">𝑡</ci></apply></apply><ci id="S3.SS2.SSS1.p1.2.m1.1.1.3.cmml" xref="S3.SS2.SSS1.p1.2.m1.1.1.3">𝑛</ci></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.SS2.SSS1.p1.2.m1.1c">\boldsymbol{H}_{left}^{n}</annotation><annotation encoding="application/x-llamapun" id="S3.SS2.SSS1.p1.2.m1.1d">bold_italic_H start_POSTSUBSCRIPT italic_l italic_e italic_f italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT</annotation></semantics></math><span class="ltx_text" id="S3.SS2.SSS1.p1.8.2" style="font-size:90%;">, </span><math alttext="\boldsymbol{H}_{right}^{n}" class="ltx_Math" display="inline" id="S3.SS2.SSS1.p1.3.m2.1"><semantics id="S3.SS2.SSS1.p1.3.m2.1a"><msubsup id="S3.SS2.SSS1.p1.3.m2.1.1" xref="S3.SS2.SSS1.p1.3.m2.1.1.cmml"><mi id="S3.SS2.SSS1.p1.3.m2.1.1.2.2" mathsize="90%" xref="S3.SS2.SSS1.p1.3.m2.1.1.2.2.cmml">𝑯</mi><mrow id="S3.SS2.SSS1.p1.3.m2.1.1.2.3" xref="S3.SS2.SSS1.p1.3.m2.1.1.2.3.cmml"><mi id="S3.SS2.SSS1.p1.3.m2.1.1.2.3.2" mathsize="90%" xref="S3.SS2.SSS1.p1.3.m2.1.1.2.3.2.cmml">r</mi><mo id="S3.SS2.SSS1.p1.3.m2.1.1.2.3.1" xref="S3.SS2.SSS1.p1.3.m2.1.1.2.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS1.p1.3.m2.1.1.2.3.3" mathsize="90%" xref="S3.SS2.SSS1.p1.3.m2.1.1.2.3.3.cmml">i</mi><mo id="S3.SS2.SSS1.p1.3.m2.1.1.2.3.1a" xref="S3.SS2.SSS1.p1.3.m2.1.1.2.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS1.p1.3.m2.1.1.2.3.4" mathsize="90%" xref="S3.SS2.SSS1.p1.3.m2.1.1.2.3.4.cmml">g</mi><mo id="S3.SS2.SSS1.p1.3.m2.1.1.2.3.1b" xref="S3.SS2.SSS1.p1.3.m2.1.1.2.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS1.p1.3.m2.1.1.2.3.5" mathsize="90%" xref="S3.SS2.SSS1.p1.3.m2.1.1.2.3.5.cmml">h</mi><mo id="S3.SS2.SSS1.p1.3.m2.1.1.2.3.1c" xref="S3.SS2.SSS1.p1.3.m2.1.1.2.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS1.p1.3.m2.1.1.2.3.6" mathsize="90%" xref="S3.SS2.SSS1.p1.3.m2.1.1.2.3.6.cmml">t</mi></mrow><mi id="S3.SS2.SSS1.p1.3.m2.1.1.3" mathsize="90%" xref="S3.SS2.SSS1.p1.3.m2.1.1.3.cmml">n</mi></msubsup><annotation-xml encoding="MathML-Content" id="S3.SS2.SSS1.p1.3.m2.1b"><apply id="S3.SS2.SSS1.p1.3.m2.1.1.cmml" xref="S3.SS2.SSS1.p1.3.m2.1.1"><csymbol cd="ambiguous" id="S3.SS2.SSS1.p1.3.m2.1.1.1.cmml" xref="S3.SS2.SSS1.p1.3.m2.1.1">superscript</csymbol><apply id="S3.SS2.SSS1.p1.3.m2.1.1.2.cmml" xref="S3.SS2.SSS1.p1.3.m2.1.1"><csymbol cd="ambiguous" id="S3.SS2.SSS1.p1.3.m2.1.1.2.1.cmml" xref="S3.SS2.SSS1.p1.3.m2.1.1">subscript</csymbol><ci id="S3.SS2.SSS1.p1.3.m2.1.1.2.2.cmml" xref="S3.SS2.SSS1.p1.3.m2.1.1.2.2">𝑯</ci><apply id="S3.SS2.SSS1.p1.3.m2.1.1.2.3.cmml" xref="S3.SS2.SSS1.p1.3.m2.1.1.2.3"><times id="S3.SS2.SSS1.p1.3.m2.1.1.2.3.1.cmml" xref="S3.SS2.SSS1.p1.3.m2.1.1.2.3.1"></times><ci id="S3.SS2.SSS1.p1.3.m2.1.1.2.3.2.cmml" xref="S3.SS2.SSS1.p1.3.m2.1.1.2.3.2">𝑟</ci><ci id="S3.SS2.SSS1.p1.3.m2.1.1.2.3.3.cmml" xref="S3.SS2.SSS1.p1.3.m2.1.1.2.3.3">𝑖</ci><ci id="S3.SS2.SSS1.p1.3.m2.1.1.2.3.4.cmml" xref="S3.SS2.SSS1.p1.3.m2.1.1.2.3.4">𝑔</ci><ci id="S3.SS2.SSS1.p1.3.m2.1.1.2.3.5.cmml" xref="S3.SS2.SSS1.p1.3.m2.1.1.2.3.5">ℎ</ci><ci id="S3.SS2.SSS1.p1.3.m2.1.1.2.3.6.cmml" xref="S3.SS2.SSS1.p1.3.m2.1.1.2.3.6">𝑡</ci></apply></apply><ci id="S3.SS2.SSS1.p1.3.m2.1.1.3.cmml" xref="S3.SS2.SSS1.p1.3.m2.1.1.3">𝑛</ci></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.SS2.SSS1.p1.3.m2.1c">\boldsymbol{H}_{right}^{n}</annotation><annotation encoding="application/x-llamapun" id="S3.SS2.SSS1.p1.3.m2.1d">bold_italic_H start_POSTSUBSCRIPT italic_r italic_i italic_g italic_h italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT</annotation></semantics></math><span class="ltx_text" id="S3.SS2.SSS1.p1.8.3" style="font-size:90%;">, and </span><math alttext="\boldsymbol{H}_{fuse}^{n}" class="ltx_Math" display="inline" id="S3.SS2.SSS1.p1.4.m3.1"><semantics id="S3.SS2.SSS1.p1.4.m3.1a"><msubsup id="S3.SS2.SSS1.p1.4.m3.1.1" xref="S3.SS2.SSS1.p1.4.m3.1.1.cmml"><mi id="S3.SS2.SSS1.p1.4.m3.1.1.2.2" mathsize="90%" xref="S3.SS2.SSS1.p1.4.m3.1.1.2.2.cmml">𝑯</mi><mrow id="S3.SS2.SSS1.p1.4.m3.1.1.2.3" xref="S3.SS2.SSS1.p1.4.m3.1.1.2.3.cmml"><mi id="S3.SS2.SSS1.p1.4.m3.1.1.2.3.2" mathsize="90%" xref="S3.SS2.SSS1.p1.4.m3.1.1.2.3.2.cmml">f</mi><mo id="S3.SS2.SSS1.p1.4.m3.1.1.2.3.1" xref="S3.SS2.SSS1.p1.4.m3.1.1.2.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS1.p1.4.m3.1.1.2.3.3" mathsize="90%" xref="S3.SS2.SSS1.p1.4.m3.1.1.2.3.3.cmml">u</mi><mo id="S3.SS2.SSS1.p1.4.m3.1.1.2.3.1a" xref="S3.SS2.SSS1.p1.4.m3.1.1.2.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS1.p1.4.m3.1.1.2.3.4" mathsize="90%" xref="S3.SS2.SSS1.p1.4.m3.1.1.2.3.4.cmml">s</mi><mo id="S3.SS2.SSS1.p1.4.m3.1.1.2.3.1b" xref="S3.SS2.SSS1.p1.4.m3.1.1.2.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS1.p1.4.m3.1.1.2.3.5" mathsize="90%" xref="S3.SS2.SSS1.p1.4.m3.1.1.2.3.5.cmml">e</mi></mrow><mi id="S3.SS2.SSS1.p1.4.m3.1.1.3" mathsize="90%" xref="S3.SS2.SSS1.p1.4.m3.1.1.3.cmml">n</mi></msubsup><annotation-xml encoding="MathML-Content" id="S3.SS2.SSS1.p1.4.m3.1b"><apply id="S3.SS2.SSS1.p1.4.m3.1.1.cmml" xref="S3.SS2.SSS1.p1.4.m3.1.1"><csymbol cd="ambiguous" id="S3.SS2.SSS1.p1.4.m3.1.1.1.cmml" xref="S3.SS2.SSS1.p1.4.m3.1.1">superscript</csymbol><apply id="S3.SS2.SSS1.p1.4.m3.1.1.2.cmml" xref="S3.SS2.SSS1.p1.4.m3.1.1"><csymbol cd="ambiguous" id="S3.SS2.SSS1.p1.4.m3.1.1.2.1.cmml" xref="S3.SS2.SSS1.p1.4.m3.1.1">subscript</csymbol><ci id="S3.SS2.SSS1.p1.4.m3.1.1.2.2.cmml" xref="S3.SS2.SSS1.p1.4.m3.1.1.2.2">𝑯</ci><apply id="S3.SS2.SSS1.p1.4.m3.1.1.2.3.cmml" xref="S3.SS2.SSS1.p1.4.m3.1.1.2.3"><times id="S3.SS2.SSS1.p1.4.m3.1.1.2.3.1.cmml" xref="S3.SS2.SSS1.p1.4.m3.1.1.2.3.1"></times><ci id="S3.SS2.SSS1.p1.4.m3.1.1.2.3.2.cmml" xref="S3.SS2.SSS1.p1.4.m3.1.1.2.3.2">𝑓</ci><ci id="S3.SS2.SSS1.p1.4.m3.1.1.2.3.3.cmml" xref="S3.SS2.SSS1.p1.4.m3.1.1.2.3.3">𝑢</ci><ci id="S3.SS2.SSS1.p1.4.m3.1.1.2.3.4.cmml" xref="S3.SS2.SSS1.p1.4.m3.1.1.2.3.4">𝑠</ci><ci id="S3.SS2.SSS1.p1.4.m3.1.1.2.3.5.cmml" xref="S3.SS2.SSS1.p1.4.m3.1.1.2.3.5">𝑒</ci></apply></apply><ci id="S3.SS2.SSS1.p1.4.m3.1.1.3.cmml" xref="S3.SS2.SSS1.p1.4.m3.1.1.3">𝑛</ci></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.SS2.SSS1.p1.4.m3.1c">\boldsymbol{H}_{fuse}^{n}</annotation><annotation encoding="application/x-llamapun" id="S3.SS2.SSS1.p1.4.m3.1d">bold_italic_H start_POSTSUBSCRIPT italic_f italic_u italic_s italic_e end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT</annotation></semantics></math><span class="ltx_text" id="S3.SS2.SSS1.p1.8.4" style="font-size:90%;"> represent the outputs of the left branch, right branch, and the fusion operation, respectively, within the </span><math alttext="n" class="ltx_Math" display="inline" id="S3.SS2.SSS1.p1.5.m4.1"><semantics id="S3.SS2.SSS1.p1.5.m4.1a"><mi id="S3.SS2.SSS1.p1.5.m4.1.1" mathsize="90%" xref="S3.SS2.SSS1.p1.5.m4.1.1.cmml">n</mi><annotation-xml encoding="MathML-Content" id="S3.SS2.SSS1.p1.5.m4.1b"><ci id="S3.SS2.SSS1.p1.5.m4.1.1.cmml" xref="S3.SS2.SSS1.p1.5.m4.1.1">𝑛</ci></annotation-xml><annotation encoding="application/x-tex" id="S3.SS2.SSS1.p1.5.m4.1c">n</annotation><annotation encoding="application/x-llamapun" id="S3.SS2.SSS1.p1.5.m4.1d">italic_n</annotation></semantics></math><span class="ltx_text" id="S3.SS2.SSS1.p1.8.5" style="font-size:90%;">-th block. The functions </span><math alttext="f_{left}(\cdot)" class="ltx_Math" display="inline" id="S3.SS2.SSS1.p1.6.m5.1"><semantics id="S3.SS2.SSS1.p1.6.m5.1a"><mrow id="S3.SS2.SSS1.p1.6.m5.1.2" xref="S3.SS2.SSS1.p1.6.m5.1.2.cmml"><msub id="S3.SS2.SSS1.p1.6.m5.1.2.2" xref="S3.SS2.SSS1.p1.6.m5.1.2.2.cmml"><mi id="S3.SS2.SSS1.p1.6.m5.1.2.2.2" mathsize="90%" xref="S3.SS2.SSS1.p1.6.m5.1.2.2.2.cmml">f</mi><mrow id="S3.SS2.SSS1.p1.6.m5.1.2.2.3" xref="S3.SS2.SSS1.p1.6.m5.1.2.2.3.cmml"><mi id="S3.SS2.SSS1.p1.6.m5.1.2.2.3.2" mathsize="90%" xref="S3.SS2.SSS1.p1.6.m5.1.2.2.3.2.cmml">l</mi><mo id="S3.SS2.SSS1.p1.6.m5.1.2.2.3.1" xref="S3.SS2.SSS1.p1.6.m5.1.2.2.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS1.p1.6.m5.1.2.2.3.3" mathsize="90%" xref="S3.SS2.SSS1.p1.6.m5.1.2.2.3.3.cmml">e</mi><mo id="S3.SS2.SSS1.p1.6.m5.1.2.2.3.1a" xref="S3.SS2.SSS1.p1.6.m5.1.2.2.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS1.p1.6.m5.1.2.2.3.4" mathsize="90%" xref="S3.SS2.SSS1.p1.6.m5.1.2.2.3.4.cmml">f</mi><mo id="S3.SS2.SSS1.p1.6.m5.1.2.2.3.1b" xref="S3.SS2.SSS1.p1.6.m5.1.2.2.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS1.p1.6.m5.1.2.2.3.5" mathsize="90%" xref="S3.SS2.SSS1.p1.6.m5.1.2.2.3.5.cmml">t</mi></mrow></msub><mo id="S3.SS2.SSS1.p1.6.m5.1.2.1" xref="S3.SS2.SSS1.p1.6.m5.1.2.1.cmml">⁢</mo><mrow id="S3.SS2.SSS1.p1.6.m5.1.2.3.2" xref="S3.SS2.SSS1.p1.6.m5.1.2.cmml"><mo id="S3.SS2.SSS1.p1.6.m5.1.2.3.2.1" maxsize="90%" minsize="90%" xref="S3.SS2.SSS1.p1.6.m5.1.2.cmml">(</mo><mo id="S3.SS2.SSS1.p1.6.m5.1.1" lspace="0em" mathsize="90%" rspace="0em" xref="S3.SS2.SSS1.p1.6.m5.1.1.cmml">⋅</mo><mo id="S3.SS2.SSS1.p1.6.m5.1.2.3.2.2" maxsize="90%" minsize="90%" xref="S3.SS2.SSS1.p1.6.m5.1.2.cmml">)</mo></mrow></mrow><annotation-xml encoding="MathML-Content" id="S3.SS2.SSS1.p1.6.m5.1b"><apply id="S3.SS2.SSS1.p1.6.m5.1.2.cmml" xref="S3.SS2.SSS1.p1.6.m5.1.2"><times id="S3.SS2.SSS1.p1.6.m5.1.2.1.cmml" xref="S3.SS2.SSS1.p1.6.m5.1.2.1"></times><apply id="S3.SS2.SSS1.p1.6.m5.1.2.2.cmml" xref="S3.SS2.SSS1.p1.6.m5.1.2.2"><csymbol cd="ambiguous" id="S3.SS2.SSS1.p1.6.m5.1.2.2.1.cmml" xref="S3.SS2.SSS1.p1.6.m5.1.2.2">subscript</csymbol><ci id="S3.SS2.SSS1.p1.6.m5.1.2.2.2.cmml" xref="S3.SS2.SSS1.p1.6.m5.1.2.2.2">𝑓</ci><apply id="S3.SS2.SSS1.p1.6.m5.1.2.2.3.cmml" xref="S3.SS2.SSS1.p1.6.m5.1.2.2.3"><times id="S3.SS2.SSS1.p1.6.m5.1.2.2.3.1.cmml" xref="S3.SS2.SSS1.p1.6.m5.1.2.2.3.1"></times><ci id="S3.SS2.SSS1.p1.6.m5.1.2.2.3.2.cmml" xref="S3.SS2.SSS1.p1.6.m5.1.2.2.3.2">𝑙</ci><ci id="S3.SS2.SSS1.p1.6.m5.1.2.2.3.3.cmml" xref="S3.SS2.SSS1.p1.6.m5.1.2.2.3.3">𝑒</ci><ci id="S3.SS2.SSS1.p1.6.m5.1.2.2.3.4.cmml" xref="S3.SS2.SSS1.p1.6.m5.1.2.2.3.4">𝑓</ci><ci id="S3.SS2.SSS1.p1.6.m5.1.2.2.3.5.cmml" xref="S3.SS2.SSS1.p1.6.m5.1.2.2.3.5">𝑡</ci></apply></apply><ci id="S3.SS2.SSS1.p1.6.m5.1.1.cmml" xref="S3.SS2.SSS1.p1.6.m5.1.1">⋅</ci></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.SS2.SSS1.p1.6.m5.1c">f_{left}(\cdot)</annotation><annotation encoding="application/x-llamapun" id="S3.SS2.SSS1.p1.6.m5.1d">italic_f start_POSTSUBSCRIPT italic_l italic_e italic_f italic_t end_POSTSUBSCRIPT ( ⋅ )</annotation></semantics></math><span class="ltx_text" id="S3.SS2.SSS1.p1.8.6" style="font-size:90%;"> and </span><math alttext="f_{right}(\cdot)" class="ltx_Math" display="inline" id="S3.SS2.SSS1.p1.7.m6.1"><semantics id="S3.SS2.SSS1.p1.7.m6.1a"><mrow id="S3.SS2.SSS1.p1.7.m6.1.2" xref="S3.SS2.SSS1.p1.7.m6.1.2.cmml"><msub id="S3.SS2.SSS1.p1.7.m6.1.2.2" xref="S3.SS2.SSS1.p1.7.m6.1.2.2.cmml"><mi id="S3.SS2.SSS1.p1.7.m6.1.2.2.2" mathsize="90%" xref="S3.SS2.SSS1.p1.7.m6.1.2.2.2.cmml">f</mi><mrow id="S3.SS2.SSS1.p1.7.m6.1.2.2.3" xref="S3.SS2.SSS1.p1.7.m6.1.2.2.3.cmml"><mi id="S3.SS2.SSS1.p1.7.m6.1.2.2.3.2" mathsize="90%" xref="S3.SS2.SSS1.p1.7.m6.1.2.2.3.2.cmml">r</mi><mo id="S3.SS2.SSS1.p1.7.m6.1.2.2.3.1" xref="S3.SS2.SSS1.p1.7.m6.1.2.2.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS1.p1.7.m6.1.2.2.3.3" mathsize="90%" xref="S3.SS2.SSS1.p1.7.m6.1.2.2.3.3.cmml">i</mi><mo id="S3.SS2.SSS1.p1.7.m6.1.2.2.3.1a" xref="S3.SS2.SSS1.p1.7.m6.1.2.2.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS1.p1.7.m6.1.2.2.3.4" mathsize="90%" xref="S3.SS2.SSS1.p1.7.m6.1.2.2.3.4.cmml">g</mi><mo id="S3.SS2.SSS1.p1.7.m6.1.2.2.3.1b" xref="S3.SS2.SSS1.p1.7.m6.1.2.2.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS1.p1.7.m6.1.2.2.3.5" mathsize="90%" xref="S3.SS2.SSS1.p1.7.m6.1.2.2.3.5.cmml">h</mi><mo id="S3.SS2.SSS1.p1.7.m6.1.2.2.3.1c" xref="S3.SS2.SSS1.p1.7.m6.1.2.2.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS1.p1.7.m6.1.2.2.3.6" mathsize="90%" xref="S3.SS2.SSS1.p1.7.m6.1.2.2.3.6.cmml">t</mi></mrow></msub><mo id="S3.SS2.SSS1.p1.7.m6.1.2.1" xref="S3.SS2.SSS1.p1.7.m6.1.2.1.cmml">⁢</mo><mrow id="S3.SS2.SSS1.p1.7.m6.1.2.3.2" xref="S3.SS2.SSS1.p1.7.m6.1.2.cmml"><mo id="S3.SS2.SSS1.p1.7.m6.1.2.3.2.1" maxsize="90%" minsize="90%" xref="S3.SS2.SSS1.p1.7.m6.1.2.cmml">(</mo><mo id="S3.SS2.SSS1.p1.7.m6.1.1" lspace="0em" mathsize="90%" rspace="0em" xref="S3.SS2.SSS1.p1.7.m6.1.1.cmml">⋅</mo><mo id="S3.SS2.SSS1.p1.7.m6.1.2.3.2.2" maxsize="90%" minsize="90%" xref="S3.SS2.SSS1.p1.7.m6.1.2.cmml">)</mo></mrow></mrow><annotation-xml encoding="MathML-Content" id="S3.SS2.SSS1.p1.7.m6.1b"><apply id="S3.SS2.SSS1.p1.7.m6.1.2.cmml" xref="S3.SS2.SSS1.p1.7.m6.1.2"><times id="S3.SS2.SSS1.p1.7.m6.1.2.1.cmml" xref="S3.SS2.SSS1.p1.7.m6.1.2.1"></times><apply id="S3.SS2.SSS1.p1.7.m6.1.2.2.cmml" xref="S3.SS2.SSS1.p1.7.m6.1.2.2"><csymbol cd="ambiguous" id="S3.SS2.SSS1.p1.7.m6.1.2.2.1.cmml" xref="S3.SS2.SSS1.p1.7.m6.1.2.2">subscript</csymbol><ci id="S3.SS2.SSS1.p1.7.m6.1.2.2.2.cmml" xref="S3.SS2.SSS1.p1.7.m6.1.2.2.2">𝑓</ci><apply id="S3.SS2.SSS1.p1.7.m6.1.2.2.3.cmml" xref="S3.SS2.SSS1.p1.7.m6.1.2.2.3"><times id="S3.SS2.SSS1.p1.7.m6.1.2.2.3.1.cmml" xref="S3.SS2.SSS1.p1.7.m6.1.2.2.3.1"></times><ci id="S3.SS2.SSS1.p1.7.m6.1.2.2.3.2.cmml" xref="S3.SS2.SSS1.p1.7.m6.1.2.2.3.2">𝑟</ci><ci id="S3.SS2.SSS1.p1.7.m6.1.2.2.3.3.cmml" xref="S3.SS2.SSS1.p1.7.m6.1.2.2.3.3">𝑖</ci><ci id="S3.SS2.SSS1.p1.7.m6.1.2.2.3.4.cmml" xref="S3.SS2.SSS1.p1.7.m6.1.2.2.3.4">𝑔</ci><ci id="S3.SS2.SSS1.p1.7.m6.1.2.2.3.5.cmml" xref="S3.SS2.SSS1.p1.7.m6.1.2.2.3.5">ℎ</ci><ci id="S3.SS2.SSS1.p1.7.m6.1.2.2.3.6.cmml" xref="S3.SS2.SSS1.p1.7.m6.1.2.2.3.6">𝑡</ci></apply></apply><ci id="S3.SS2.SSS1.p1.7.m6.1.1.cmml" xref="S3.SS2.SSS1.p1.7.m6.1.1">⋅</ci></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.SS2.SSS1.p1.7.m6.1c">f_{right}(\cdot)</annotation><annotation encoding="application/x-llamapun" id="S3.SS2.SSS1.p1.7.m6.1d">italic_f start_POSTSUBSCRIPT italic_r italic_i italic_g italic_h italic_t end_POSTSUBSCRIPT ( ⋅ )</annotation></semantics></math><span class="ltx_text" id="S3.SS2.SSS1.p1.8.7" style="font-size:90%;"> correspond to the transformations in the two branches, while </span><math alttext="\text{LN}(\cdot)" class="ltx_Math" display="inline" id="S3.SS2.SSS1.p1.8.m7.1"><semantics id="S3.SS2.SSS1.p1.8.m7.1a"><mrow id="S3.SS2.SSS1.p1.8.m7.1.2" xref="S3.SS2.SSS1.p1.8.m7.1.2.cmml"><mtext id="S3.SS2.SSS1.p1.8.m7.1.2.2" mathsize="90%" xref="S3.SS2.SSS1.p1.8.m7.1.2.2a.cmml">LN</mtext><mo id="S3.SS2.SSS1.p1.8.m7.1.2.1" xref="S3.SS2.SSS1.p1.8.m7.1.2.1.cmml">⁢</mo><mrow id="S3.SS2.SSS1.p1.8.m7.1.2.3.2" xref="S3.SS2.SSS1.p1.8.m7.1.2.cmml"><mo id="S3.SS2.SSS1.p1.8.m7.1.2.3.2.1" maxsize="90%" minsize="90%" xref="S3.SS2.SSS1.p1.8.m7.1.2.cmml">(</mo><mo id="S3.SS2.SSS1.p1.8.m7.1.1" lspace="0em" mathsize="90%" rspace="0em" xref="S3.SS2.SSS1.p1.8.m7.1.1.cmml">⋅</mo><mo id="S3.SS2.SSS1.p1.8.m7.1.2.3.2.2" maxsize="90%" minsize="90%" xref="S3.SS2.SSS1.p1.8.m7.1.2.cmml">)</mo></mrow></mrow><annotation-xml encoding="MathML-Content" id="S3.SS2.SSS1.p1.8.m7.1b"><apply id="S3.SS2.SSS1.p1.8.m7.1.2.cmml" xref="S3.SS2.SSS1.p1.8.m7.1.2"><times id="S3.SS2.SSS1.p1.8.m7.1.2.1.cmml" xref="S3.SS2.SSS1.p1.8.m7.1.2.1"></times><ci id="S3.SS2.SSS1.p1.8.m7.1.2.2a.cmml" xref="S3.SS2.SSS1.p1.8.m7.1.2.2"><mtext id="S3.SS2.SSS1.p1.8.m7.1.2.2.cmml" mathsize="90%" xref="S3.SS2.SSS1.p1.8.m7.1.2.2">LN</mtext></ci><ci id="S3.SS2.SSS1.p1.8.m7.1.1.cmml" xref="S3.SS2.SSS1.p1.8.m7.1.1">⋅</ci></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.SS2.SSS1.p1.8.m7.1c">\text{LN}(\cdot)</annotation><annotation encoding="application/x-llamapun" id="S3.SS2.SSS1.p1.8.m7.1d">LN ( ⋅ )</annotation></semantics></math><span class="ltx_text" id="S3.SS2.SSS1.p1.8.8" style="font-size:90%;"> signifies the layer normalization, identical to the one used in </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S3.SS2.SSS1.p1.8.9.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib41" title="">41</a><span class="ltx_text" id="S3.SS2.SSS1.p1.8.10.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S3.SS2.SSS1.p1.8.11" style="font-size:90%;">.</span></p> </div> <div class="ltx_para" id="S3.SS2.SSS1.p2"> <p class="ltx_p" id="S3.SS2.SSS1.p2.3"><span class="ltx_text" id="S3.SS2.SSS1.p2.3.1" style="font-size:90%;">Furthermore, a one-dimensional convolutional layer was employed to operate on the output of the fusion operation, enhancing the non-linearity of the entire block. In contrast to the use of fully connected layers in </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S3.SS2.SSS1.p2.3.2.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib41" title="">41</a><span class="ltx_text" id="S3.SS2.SSS1.p2.3.3.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S3.SS2.SSS1.p2.3.4" style="font-size:90%;">, we contend that the local correlations in speech features need to be emphasized. Thus, drawing inspiration from the operations described in FastSpeech </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S3.SS2.SSS1.p2.3.5.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib42" title="">42</a>, <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib43" title="">43</a><span class="ltx_text" id="S3.SS2.SSS1.p2.3.6.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S3.SS2.SSS1.p2.3.7" style="font-size:90%;">, convolutional layers were incorporated to capture these local correlations. Additionally, residual connections and layer normalization were utilized to facilitate more stable training of the model. The entire computational process is illustrated as follows:</span></p> <table class="ltx_equationgroup ltx_eqn_align ltx_eqn_table" id="S6.EGx2"> <tbody id="S3.E4"><tr class="ltx_equation ltx_eqn_row ltx_align_baseline"> <td class="ltx_eqn_cell ltx_eqn_center_padleft"></td> <td class="ltx_td ltx_align_right ltx_eqn_cell"><math alttext="\displaystyle\boldsymbol{H}_{final}^{n}=\text{LN}((\text{ReLU}(\text{Conv}(% \boldsymbol{H}_{fuse}^{n}))+\boldsymbol{H}_{fuse}^{n})," class="ltx_math_unparsed" display="inline" id="S3.E4.m1.1"><semantics id="S3.E4.m1.1a"><mrow id="S3.E4.m1.1b"><msubsup id="S3.E4.m1.1.1"><mi id="S3.E4.m1.1.1.2.2" mathsize="90%">𝑯</mi><mrow id="S3.E4.m1.1.1.2.3"><mi id="S3.E4.m1.1.1.2.3.2" mathsize="90%">f</mi><mo id="S3.E4.m1.1.1.2.3.1">⁢</mo><mi id="S3.E4.m1.1.1.2.3.3" mathsize="90%">i</mi><mo id="S3.E4.m1.1.1.2.3.1a">⁢</mo><mi id="S3.E4.m1.1.1.2.3.4" mathsize="90%">n</mi><mo id="S3.E4.m1.1.1.2.3.1b">⁢</mo><mi id="S3.E4.m1.1.1.2.3.5" mathsize="90%">a</mi><mo id="S3.E4.m1.1.1.2.3.1c">⁢</mo><mi id="S3.E4.m1.1.1.2.3.6" mathsize="90%">l</mi></mrow><mi id="S3.E4.m1.1.1.3" mathsize="90%">n</mi></msubsup><mo id="S3.E4.m1.1.2" mathsize="90%">=</mo><mtext id="S3.E4.m1.1.3" mathsize="90%">LN</mtext><mrow id="S3.E4.m1.1.4"><mo id="S3.E4.m1.1.4.1" maxsize="90%" minsize="90%">(</mo><mrow id="S3.E4.m1.1.4.2"><mo id="S3.E4.m1.1.4.2.1" maxsize="90%" minsize="90%">(</mo><mtext id="S3.E4.m1.1.4.2.2" mathsize="90%">ReLU</mtext><mrow id="S3.E4.m1.1.4.2.3"><mo id="S3.E4.m1.1.4.2.3.1" maxsize="90%" minsize="90%">(</mo><mtext id="S3.E4.m1.1.4.2.3.2" mathsize="90%">Conv</mtext><mrow id="S3.E4.m1.1.4.2.3.3"><mo id="S3.E4.m1.1.4.2.3.3.1" maxsize="90%" minsize="90%">(</mo><msubsup id="S3.E4.m1.1.4.2.3.3.2"><mi id="S3.E4.m1.1.4.2.3.3.2.2.2" mathsize="90%">𝑯</mi><mrow id="S3.E4.m1.1.4.2.3.3.2.2.3"><mi id="S3.E4.m1.1.4.2.3.3.2.2.3.2" mathsize="90%">f</mi><mo id="S3.E4.m1.1.4.2.3.3.2.2.3.1">⁢</mo><mi id="S3.E4.m1.1.4.2.3.3.2.2.3.3" mathsize="90%">u</mi><mo id="S3.E4.m1.1.4.2.3.3.2.2.3.1a">⁢</mo><mi id="S3.E4.m1.1.4.2.3.3.2.2.3.4" mathsize="90%">s</mi><mo id="S3.E4.m1.1.4.2.3.3.2.2.3.1b">⁢</mo><mi id="S3.E4.m1.1.4.2.3.3.2.2.3.5" mathsize="90%">e</mi></mrow><mi id="S3.E4.m1.1.4.2.3.3.2.3" mathsize="90%">n</mi></msubsup><mo id="S3.E4.m1.1.4.2.3.3.3" maxsize="90%" minsize="90%">)</mo></mrow><mo id="S3.E4.m1.1.4.2.3.4" maxsize="90%" minsize="90%">)</mo></mrow><mo id="S3.E4.m1.1.4.2.4" mathsize="90%">+</mo><msubsup id="S3.E4.m1.1.4.2.5"><mi id="S3.E4.m1.1.4.2.5.2.2" mathsize="90%">𝑯</mi><mrow id="S3.E4.m1.1.4.2.5.2.3"><mi id="S3.E4.m1.1.4.2.5.2.3.2" mathsize="90%">f</mi><mo id="S3.E4.m1.1.4.2.5.2.3.1">⁢</mo><mi id="S3.E4.m1.1.4.2.5.2.3.3" mathsize="90%">u</mi><mo id="S3.E4.m1.1.4.2.5.2.3.1a">⁢</mo><mi id="S3.E4.m1.1.4.2.5.2.3.4" mathsize="90%">s</mi><mo id="S3.E4.m1.1.4.2.5.2.3.1b">⁢</mo><mi id="S3.E4.m1.1.4.2.5.2.3.5" mathsize="90%">e</mi></mrow><mi id="S3.E4.m1.1.4.2.5.3" mathsize="90%">n</mi></msubsup><mo id="S3.E4.m1.1.4.2.6" maxsize="90%" minsize="90%">)</mo></mrow><mo id="S3.E4.m1.1.4.3" mathsize="90%">,</mo></mrow></mrow><annotation encoding="application/x-tex" id="S3.E4.m1.1c">\displaystyle\boldsymbol{H}_{final}^{n}=\text{LN}((\text{ReLU}(\text{Conv}(% \boldsymbol{H}_{fuse}^{n}))+\boldsymbol{H}_{fuse}^{n}),</annotation><annotation encoding="application/x-llamapun" id="S3.E4.m1.1d">bold_italic_H start_POSTSUBSCRIPT italic_f italic_i italic_n italic_a italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT = LN ( ( ReLU ( Conv ( bold_italic_H start_POSTSUBSCRIPT italic_f italic_u italic_s italic_e end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ) ) + bold_italic_H start_POSTSUBSCRIPT italic_f italic_u italic_s italic_e end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ) ,</annotation></semantics></math></td> <td class="ltx_eqn_cell ltx_eqn_center_padright"></td> <td class="ltx_eqn_cell ltx_eqn_eqno ltx_align_middle ltx_align_right" rowspan="1"><span class="ltx_tag ltx_tag_equation ltx_align_right">(4)</span></td> </tr></tbody> </table> <p class="ltx_p" id="S3.SS2.SSS1.p2.2"><span class="ltx_text" id="S3.SS2.SSS1.p2.2.1" style="font-size:90%;">where </span><math alttext="\boldsymbol{H}_{final}^{n}" class="ltx_Math" display="inline" id="S3.SS2.SSS1.p2.1.m1.1"><semantics id="S3.SS2.SSS1.p2.1.m1.1a"><msubsup id="S3.SS2.SSS1.p2.1.m1.1.1" xref="S3.SS2.SSS1.p2.1.m1.1.1.cmml"><mi id="S3.SS2.SSS1.p2.1.m1.1.1.2.2" mathsize="90%" xref="S3.SS2.SSS1.p2.1.m1.1.1.2.2.cmml">𝑯</mi><mrow id="S3.SS2.SSS1.p2.1.m1.1.1.2.3" xref="S3.SS2.SSS1.p2.1.m1.1.1.2.3.cmml"><mi id="S3.SS2.SSS1.p2.1.m1.1.1.2.3.2" mathsize="90%" xref="S3.SS2.SSS1.p2.1.m1.1.1.2.3.2.cmml">f</mi><mo id="S3.SS2.SSS1.p2.1.m1.1.1.2.3.1" xref="S3.SS2.SSS1.p2.1.m1.1.1.2.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS1.p2.1.m1.1.1.2.3.3" mathsize="90%" xref="S3.SS2.SSS1.p2.1.m1.1.1.2.3.3.cmml">i</mi><mo id="S3.SS2.SSS1.p2.1.m1.1.1.2.3.1a" xref="S3.SS2.SSS1.p2.1.m1.1.1.2.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS1.p2.1.m1.1.1.2.3.4" mathsize="90%" xref="S3.SS2.SSS1.p2.1.m1.1.1.2.3.4.cmml">n</mi><mo id="S3.SS2.SSS1.p2.1.m1.1.1.2.3.1b" xref="S3.SS2.SSS1.p2.1.m1.1.1.2.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS1.p2.1.m1.1.1.2.3.5" mathsize="90%" xref="S3.SS2.SSS1.p2.1.m1.1.1.2.3.5.cmml">a</mi><mo id="S3.SS2.SSS1.p2.1.m1.1.1.2.3.1c" xref="S3.SS2.SSS1.p2.1.m1.1.1.2.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS1.p2.1.m1.1.1.2.3.6" mathsize="90%" xref="S3.SS2.SSS1.p2.1.m1.1.1.2.3.6.cmml">l</mi></mrow><mi id="S3.SS2.SSS1.p2.1.m1.1.1.3" mathsize="90%" xref="S3.SS2.SSS1.p2.1.m1.1.1.3.cmml">n</mi></msubsup><annotation-xml encoding="MathML-Content" id="S3.SS2.SSS1.p2.1.m1.1b"><apply id="S3.SS2.SSS1.p2.1.m1.1.1.cmml" xref="S3.SS2.SSS1.p2.1.m1.1.1"><csymbol cd="ambiguous" id="S3.SS2.SSS1.p2.1.m1.1.1.1.cmml" xref="S3.SS2.SSS1.p2.1.m1.1.1">superscript</csymbol><apply id="S3.SS2.SSS1.p2.1.m1.1.1.2.cmml" xref="S3.SS2.SSS1.p2.1.m1.1.1"><csymbol cd="ambiguous" id="S3.SS2.SSS1.p2.1.m1.1.1.2.1.cmml" xref="S3.SS2.SSS1.p2.1.m1.1.1">subscript</csymbol><ci id="S3.SS2.SSS1.p2.1.m1.1.1.2.2.cmml" xref="S3.SS2.SSS1.p2.1.m1.1.1.2.2">𝑯</ci><apply id="S3.SS2.SSS1.p2.1.m1.1.1.2.3.cmml" xref="S3.SS2.SSS1.p2.1.m1.1.1.2.3"><times id="S3.SS2.SSS1.p2.1.m1.1.1.2.3.1.cmml" xref="S3.SS2.SSS1.p2.1.m1.1.1.2.3.1"></times><ci id="S3.SS2.SSS1.p2.1.m1.1.1.2.3.2.cmml" xref="S3.SS2.SSS1.p2.1.m1.1.1.2.3.2">𝑓</ci><ci id="S3.SS2.SSS1.p2.1.m1.1.1.2.3.3.cmml" xref="S3.SS2.SSS1.p2.1.m1.1.1.2.3.3">𝑖</ci><ci id="S3.SS2.SSS1.p2.1.m1.1.1.2.3.4.cmml" xref="S3.SS2.SSS1.p2.1.m1.1.1.2.3.4">𝑛</ci><ci id="S3.SS2.SSS1.p2.1.m1.1.1.2.3.5.cmml" xref="S3.SS2.SSS1.p2.1.m1.1.1.2.3.5">𝑎</ci><ci id="S3.SS2.SSS1.p2.1.m1.1.1.2.3.6.cmml" xref="S3.SS2.SSS1.p2.1.m1.1.1.2.3.6">𝑙</ci></apply></apply><ci id="S3.SS2.SSS1.p2.1.m1.1.1.3.cmml" xref="S3.SS2.SSS1.p2.1.m1.1.1.3">𝑛</ci></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.SS2.SSS1.p2.1.m1.1c">\boldsymbol{H}_{final}^{n}</annotation><annotation encoding="application/x-llamapun" id="S3.SS2.SSS1.p2.1.m1.1d">bold_italic_H start_POSTSUBSCRIPT italic_f italic_i italic_n italic_a italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT</annotation></semantics></math><span class="ltx_text" id="S3.SS2.SSS1.p2.2.2" style="font-size:90%;"> represents the final output of the </span><math alttext="n" class="ltx_Math" display="inline" id="S3.SS2.SSS1.p2.2.m2.1"><semantics id="S3.SS2.SSS1.p2.2.m2.1a"><mi id="S3.SS2.SSS1.p2.2.m2.1.1" mathsize="90%" xref="S3.SS2.SSS1.p2.2.m2.1.1.cmml">n</mi><annotation-xml encoding="MathML-Content" id="S3.SS2.SSS1.p2.2.m2.1b"><ci id="S3.SS2.SSS1.p2.2.m2.1.1.cmml" xref="S3.SS2.SSS1.p2.2.m2.1.1">𝑛</ci></annotation-xml><annotation encoding="application/x-tex" id="S3.SS2.SSS1.p2.2.m2.1c">n</annotation><annotation encoding="application/x-llamapun" id="S3.SS2.SSS1.p2.2.m2.1d">italic_n</annotation></semantics></math><span class="ltx_text" id="S3.SS2.SSS1.p2.2.3" style="font-size:90%;">-th block.</span></p> </div> </section> <section class="ltx_subsubsection" id="S3.SS2.SSS2"> <h4 class="ltx_title ltx_title_subsubsection" style="font-size:90%;"> <span class="ltx_tag ltx_tag_subsubsection">3.2.2 </span>Speaker classifier and spoof classifier</h4> <div class="ltx_para" id="S3.SS2.SSS2.p1"> <p class="ltx_p" id="S3.SS2.SSS2.p1.3"><span class="ltx_text" id="S3.SS2.SSS2.p1.3.1" style="font-size:90%;">The architectures of both the speaker classifier and the spoof classifier in the model are nearly identical, differing only in the output nodes of the final fully connected layers, which are associated with the number of classes. Drawing inspiration from the successful utilization of multi-layer feature aggregation as seen in </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S3.SS2.SSS2.p1.3.2.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib7" title="">7</a><span class="ltx_text" id="S3.SS2.SSS2.p1.3.3.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S3.SS2.SSS2.p1.3.4" style="font-size:90%;"> and </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S3.SS2.SSS2.p1.3.5.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib44" title="">44</a><span class="ltx_text" id="S3.SS2.SSS2.p1.3.6.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S3.SS2.SSS2.p1.3.7" style="font-size:90%;">, we have also incorporated this strategy to aggregate the output of each left branch and each right branch for the anti-spoofing and ASV sub-objectives, respectively. This aggregation can be expressed as follows:</span></p> <table class="ltx_equationgroup ltx_eqn_align ltx_eqn_table" id="S6.EGx3"> <tbody id="S3.E5"><tr class="ltx_equation ltx_eqn_row ltx_align_baseline"> <td class="ltx_eqn_cell ltx_eqn_center_padleft"></td> <td class="ltx_td ltx_align_right ltx_eqn_cell"><math alttext="\displaystyle\boldsymbol{H}_{spoof}" class="ltx_Math" display="inline" id="S3.E5.m1.1"><semantics id="S3.E5.m1.1a"><msub id="S3.E5.m1.1.1" xref="S3.E5.m1.1.1.cmml"><mi id="S3.E5.m1.1.1.2" mathsize="90%" xref="S3.E5.m1.1.1.2.cmml">𝑯</mi><mrow id="S3.E5.m1.1.1.3" xref="S3.E5.m1.1.1.3.cmml"><mi id="S3.E5.m1.1.1.3.2" mathsize="90%" xref="S3.E5.m1.1.1.3.2.cmml">s</mi><mo id="S3.E5.m1.1.1.3.1" xref="S3.E5.m1.1.1.3.1.cmml">⁢</mo><mi id="S3.E5.m1.1.1.3.3" mathsize="90%" xref="S3.E5.m1.1.1.3.3.cmml">p</mi><mo id="S3.E5.m1.1.1.3.1a" xref="S3.E5.m1.1.1.3.1.cmml">⁢</mo><mi id="S3.E5.m1.1.1.3.4" mathsize="90%" xref="S3.E5.m1.1.1.3.4.cmml">o</mi><mo id="S3.E5.m1.1.1.3.1b" xref="S3.E5.m1.1.1.3.1.cmml">⁢</mo><mi id="S3.E5.m1.1.1.3.5" mathsize="90%" xref="S3.E5.m1.1.1.3.5.cmml">o</mi><mo id="S3.E5.m1.1.1.3.1c" xref="S3.E5.m1.1.1.3.1.cmml">⁢</mo><mi id="S3.E5.m1.1.1.3.6" mathsize="90%" xref="S3.E5.m1.1.1.3.6.cmml">f</mi></mrow></msub><annotation-xml encoding="MathML-Content" id="S3.E5.m1.1b"><apply id="S3.E5.m1.1.1.cmml" xref="S3.E5.m1.1.1"><csymbol cd="ambiguous" id="S3.E5.m1.1.1.1.cmml" xref="S3.E5.m1.1.1">subscript</csymbol><ci id="S3.E5.m1.1.1.2.cmml" xref="S3.E5.m1.1.1.2">𝑯</ci><apply id="S3.E5.m1.1.1.3.cmml" xref="S3.E5.m1.1.1.3"><times id="S3.E5.m1.1.1.3.1.cmml" xref="S3.E5.m1.1.1.3.1"></times><ci id="S3.E5.m1.1.1.3.2.cmml" xref="S3.E5.m1.1.1.3.2">𝑠</ci><ci id="S3.E5.m1.1.1.3.3.cmml" xref="S3.E5.m1.1.1.3.3">𝑝</ci><ci id="S3.E5.m1.1.1.3.4.cmml" xref="S3.E5.m1.1.1.3.4">𝑜</ci><ci id="S3.E5.m1.1.1.3.5.cmml" xref="S3.E5.m1.1.1.3.5">𝑜</ci><ci id="S3.E5.m1.1.1.3.6.cmml" xref="S3.E5.m1.1.1.3.6">𝑓</ci></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.E5.m1.1c">\displaystyle\boldsymbol{H}_{spoof}</annotation><annotation encoding="application/x-llamapun" id="S3.E5.m1.1d">bold_italic_H start_POSTSUBSCRIPT italic_s italic_p italic_o italic_o italic_f end_POSTSUBSCRIPT</annotation></semantics></math></td> <td class="ltx_td ltx_align_left ltx_eqn_cell"><math alttext="\displaystyle=\frac{1}{N}\sum_{n=1}^{N}\boldsymbol{H}_{left}^{n}," class="ltx_Math" display="inline" id="S3.E5.m2.1"><semantics id="S3.E5.m2.1a"><mrow id="S3.E5.m2.1.1.1" xref="S3.E5.m2.1.1.1.1.cmml"><mrow id="S3.E5.m2.1.1.1.1" xref="S3.E5.m2.1.1.1.1.cmml"><mi id="S3.E5.m2.1.1.1.1.2" xref="S3.E5.m2.1.1.1.1.2.cmml"></mi><mo id="S3.E5.m2.1.1.1.1.1" mathsize="90%" xref="S3.E5.m2.1.1.1.1.1.cmml">=</mo><mrow id="S3.E5.m2.1.1.1.1.3" xref="S3.E5.m2.1.1.1.1.3.cmml"><mstyle displaystyle="true" id="S3.E5.m2.1.1.1.1.3.2" xref="S3.E5.m2.1.1.1.1.3.2.cmml"><mfrac id="S3.E5.m2.1.1.1.1.3.2a" xref="S3.E5.m2.1.1.1.1.3.2.cmml"><mn id="S3.E5.m2.1.1.1.1.3.2.2" mathsize="90%" xref="S3.E5.m2.1.1.1.1.3.2.2.cmml">1</mn><mi id="S3.E5.m2.1.1.1.1.3.2.3" mathsize="90%" xref="S3.E5.m2.1.1.1.1.3.2.3.cmml">N</mi></mfrac></mstyle><mo id="S3.E5.m2.1.1.1.1.3.1" xref="S3.E5.m2.1.1.1.1.3.1.cmml">⁢</mo><mrow id="S3.E5.m2.1.1.1.1.3.3" xref="S3.E5.m2.1.1.1.1.3.3.cmml"><mstyle displaystyle="true" id="S3.E5.m2.1.1.1.1.3.3.1" xref="S3.E5.m2.1.1.1.1.3.3.1.cmml"><munderover id="S3.E5.m2.1.1.1.1.3.3.1a" xref="S3.E5.m2.1.1.1.1.3.3.1.cmml"><mo id="S3.E5.m2.1.1.1.1.3.3.1.2.2" maxsize="90%" minsize="90%" movablelimits="false" stretchy="true" xref="S3.E5.m2.1.1.1.1.3.3.1.2.2.cmml">∑</mo><mrow id="S3.E5.m2.1.1.1.1.3.3.1.2.3" xref="S3.E5.m2.1.1.1.1.3.3.1.2.3.cmml"><mi id="S3.E5.m2.1.1.1.1.3.3.1.2.3.2" mathsize="90%" xref="S3.E5.m2.1.1.1.1.3.3.1.2.3.2.cmml">n</mi><mo id="S3.E5.m2.1.1.1.1.3.3.1.2.3.1" mathsize="90%" xref="S3.E5.m2.1.1.1.1.3.3.1.2.3.1.cmml">=</mo><mn id="S3.E5.m2.1.1.1.1.3.3.1.2.3.3" mathsize="90%" xref="S3.E5.m2.1.1.1.1.3.3.1.2.3.3.cmml">1</mn></mrow><mi id="S3.E5.m2.1.1.1.1.3.3.1.3" mathsize="90%" xref="S3.E5.m2.1.1.1.1.3.3.1.3.cmml">N</mi></munderover></mstyle><msubsup id="S3.E5.m2.1.1.1.1.3.3.2" xref="S3.E5.m2.1.1.1.1.3.3.2.cmml"><mi id="S3.E5.m2.1.1.1.1.3.3.2.2.2" mathsize="90%" xref="S3.E5.m2.1.1.1.1.3.3.2.2.2.cmml">𝑯</mi><mrow id="S3.E5.m2.1.1.1.1.3.3.2.2.3" xref="S3.E5.m2.1.1.1.1.3.3.2.2.3.cmml"><mi id="S3.E5.m2.1.1.1.1.3.3.2.2.3.2" mathsize="90%" xref="S3.E5.m2.1.1.1.1.3.3.2.2.3.2.cmml">l</mi><mo id="S3.E5.m2.1.1.1.1.3.3.2.2.3.1" xref="S3.E5.m2.1.1.1.1.3.3.2.2.3.1.cmml">⁢</mo><mi id="S3.E5.m2.1.1.1.1.3.3.2.2.3.3" mathsize="90%" xref="S3.E5.m2.1.1.1.1.3.3.2.2.3.3.cmml">e</mi><mo id="S3.E5.m2.1.1.1.1.3.3.2.2.3.1a" xref="S3.E5.m2.1.1.1.1.3.3.2.2.3.1.cmml">⁢</mo><mi id="S3.E5.m2.1.1.1.1.3.3.2.2.3.4" mathsize="90%" xref="S3.E5.m2.1.1.1.1.3.3.2.2.3.4.cmml">f</mi><mo id="S3.E5.m2.1.1.1.1.3.3.2.2.3.1b" xref="S3.E5.m2.1.1.1.1.3.3.2.2.3.1.cmml">⁢</mo><mi id="S3.E5.m2.1.1.1.1.3.3.2.2.3.5" mathsize="90%" xref="S3.E5.m2.1.1.1.1.3.3.2.2.3.5.cmml">t</mi></mrow><mi id="S3.E5.m2.1.1.1.1.3.3.2.3" mathsize="90%" xref="S3.E5.m2.1.1.1.1.3.3.2.3.cmml">n</mi></msubsup></mrow></mrow></mrow><mo id="S3.E5.m2.1.1.1.2" mathsize="90%" xref="S3.E5.m2.1.1.1.1.cmml">,</mo></mrow><annotation-xml encoding="MathML-Content" id="S3.E5.m2.1b"><apply id="S3.E5.m2.1.1.1.1.cmml" xref="S3.E5.m2.1.1.1"><eq id="S3.E5.m2.1.1.1.1.1.cmml" xref="S3.E5.m2.1.1.1.1.1"></eq><csymbol cd="latexml" id="S3.E5.m2.1.1.1.1.2.cmml" xref="S3.E5.m2.1.1.1.1.2">absent</csymbol><apply id="S3.E5.m2.1.1.1.1.3.cmml" xref="S3.E5.m2.1.1.1.1.3"><times id="S3.E5.m2.1.1.1.1.3.1.cmml" xref="S3.E5.m2.1.1.1.1.3.1"></times><apply id="S3.E5.m2.1.1.1.1.3.2.cmml" xref="S3.E5.m2.1.1.1.1.3.2"><divide id="S3.E5.m2.1.1.1.1.3.2.1.cmml" xref="S3.E5.m2.1.1.1.1.3.2"></divide><cn id="S3.E5.m2.1.1.1.1.3.2.2.cmml" type="integer" xref="S3.E5.m2.1.1.1.1.3.2.2">1</cn><ci id="S3.E5.m2.1.1.1.1.3.2.3.cmml" xref="S3.E5.m2.1.1.1.1.3.2.3">𝑁</ci></apply><apply id="S3.E5.m2.1.1.1.1.3.3.cmml" xref="S3.E5.m2.1.1.1.1.3.3"><apply id="S3.E5.m2.1.1.1.1.3.3.1.cmml" xref="S3.E5.m2.1.1.1.1.3.3.1"><csymbol cd="ambiguous" id="S3.E5.m2.1.1.1.1.3.3.1.1.cmml" xref="S3.E5.m2.1.1.1.1.3.3.1">superscript</csymbol><apply id="S3.E5.m2.1.1.1.1.3.3.1.2.cmml" xref="S3.E5.m2.1.1.1.1.3.3.1"><csymbol cd="ambiguous" id="S3.E5.m2.1.1.1.1.3.3.1.2.1.cmml" xref="S3.E5.m2.1.1.1.1.3.3.1">subscript</csymbol><sum id="S3.E5.m2.1.1.1.1.3.3.1.2.2.cmml" xref="S3.E5.m2.1.1.1.1.3.3.1.2.2"></sum><apply id="S3.E5.m2.1.1.1.1.3.3.1.2.3.cmml" xref="S3.E5.m2.1.1.1.1.3.3.1.2.3"><eq id="S3.E5.m2.1.1.1.1.3.3.1.2.3.1.cmml" xref="S3.E5.m2.1.1.1.1.3.3.1.2.3.1"></eq><ci id="S3.E5.m2.1.1.1.1.3.3.1.2.3.2.cmml" xref="S3.E5.m2.1.1.1.1.3.3.1.2.3.2">𝑛</ci><cn id="S3.E5.m2.1.1.1.1.3.3.1.2.3.3.cmml" type="integer" xref="S3.E5.m2.1.1.1.1.3.3.1.2.3.3">1</cn></apply></apply><ci id="S3.E5.m2.1.1.1.1.3.3.1.3.cmml" xref="S3.E5.m2.1.1.1.1.3.3.1.3">𝑁</ci></apply><apply id="S3.E5.m2.1.1.1.1.3.3.2.cmml" xref="S3.E5.m2.1.1.1.1.3.3.2"><csymbol cd="ambiguous" id="S3.E5.m2.1.1.1.1.3.3.2.1.cmml" xref="S3.E5.m2.1.1.1.1.3.3.2">superscript</csymbol><apply id="S3.E5.m2.1.1.1.1.3.3.2.2.cmml" xref="S3.E5.m2.1.1.1.1.3.3.2"><csymbol cd="ambiguous" id="S3.E5.m2.1.1.1.1.3.3.2.2.1.cmml" xref="S3.E5.m2.1.1.1.1.3.3.2">subscript</csymbol><ci id="S3.E5.m2.1.1.1.1.3.3.2.2.2.cmml" xref="S3.E5.m2.1.1.1.1.3.3.2.2.2">𝑯</ci><apply id="S3.E5.m2.1.1.1.1.3.3.2.2.3.cmml" xref="S3.E5.m2.1.1.1.1.3.3.2.2.3"><times id="S3.E5.m2.1.1.1.1.3.3.2.2.3.1.cmml" xref="S3.E5.m2.1.1.1.1.3.3.2.2.3.1"></times><ci id="S3.E5.m2.1.1.1.1.3.3.2.2.3.2.cmml" xref="S3.E5.m2.1.1.1.1.3.3.2.2.3.2">𝑙</ci><ci id="S3.E5.m2.1.1.1.1.3.3.2.2.3.3.cmml" xref="S3.E5.m2.1.1.1.1.3.3.2.2.3.3">𝑒</ci><ci id="S3.E5.m2.1.1.1.1.3.3.2.2.3.4.cmml" xref="S3.E5.m2.1.1.1.1.3.3.2.2.3.4">𝑓</ci><ci id="S3.E5.m2.1.1.1.1.3.3.2.2.3.5.cmml" xref="S3.E5.m2.1.1.1.1.3.3.2.2.3.5">𝑡</ci></apply></apply><ci id="S3.E5.m2.1.1.1.1.3.3.2.3.cmml" xref="S3.E5.m2.1.1.1.1.3.3.2.3">𝑛</ci></apply></apply></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.E5.m2.1c">\displaystyle=\frac{1}{N}\sum_{n=1}^{N}\boldsymbol{H}_{left}^{n},</annotation><annotation encoding="application/x-llamapun" id="S3.E5.m2.1d">= divide start_ARG 1 end_ARG start_ARG italic_N end_ARG ∑ start_POSTSUBSCRIPT italic_n = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT bold_italic_H start_POSTSUBSCRIPT italic_l italic_e italic_f italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ,</annotation></semantics></math></td> <td class="ltx_eqn_cell ltx_eqn_center_padright"></td> <td class="ltx_eqn_cell ltx_eqn_eqno ltx_align_middle ltx_align_right" rowspan="1"><span class="ltx_tag ltx_tag_equation ltx_align_right">(5)</span></td> </tr></tbody> <tbody id="S3.E6"><tr class="ltx_equation ltx_eqn_row ltx_align_baseline"> <td class="ltx_eqn_cell ltx_eqn_center_padleft"></td> <td class="ltx_td ltx_align_right ltx_eqn_cell"><math alttext="\displaystyle\boldsymbol{H}_{asv}" class="ltx_Math" display="inline" id="S3.E6.m1.1"><semantics id="S3.E6.m1.1a"><msub id="S3.E6.m1.1.1" xref="S3.E6.m1.1.1.cmml"><mi id="S3.E6.m1.1.1.2" mathsize="90%" xref="S3.E6.m1.1.1.2.cmml">𝑯</mi><mrow id="S3.E6.m1.1.1.3" xref="S3.E6.m1.1.1.3.cmml"><mi id="S3.E6.m1.1.1.3.2" mathsize="90%" xref="S3.E6.m1.1.1.3.2.cmml">a</mi><mo id="S3.E6.m1.1.1.3.1" xref="S3.E6.m1.1.1.3.1.cmml">⁢</mo><mi id="S3.E6.m1.1.1.3.3" mathsize="90%" xref="S3.E6.m1.1.1.3.3.cmml">s</mi><mo id="S3.E6.m1.1.1.3.1a" xref="S3.E6.m1.1.1.3.1.cmml">⁢</mo><mi id="S3.E6.m1.1.1.3.4" mathsize="90%" xref="S3.E6.m1.1.1.3.4.cmml">v</mi></mrow></msub><annotation-xml encoding="MathML-Content" id="S3.E6.m1.1b"><apply id="S3.E6.m1.1.1.cmml" xref="S3.E6.m1.1.1"><csymbol cd="ambiguous" id="S3.E6.m1.1.1.1.cmml" xref="S3.E6.m1.1.1">subscript</csymbol><ci id="S3.E6.m1.1.1.2.cmml" xref="S3.E6.m1.1.1.2">𝑯</ci><apply id="S3.E6.m1.1.1.3.cmml" xref="S3.E6.m1.1.1.3"><times id="S3.E6.m1.1.1.3.1.cmml" xref="S3.E6.m1.1.1.3.1"></times><ci id="S3.E6.m1.1.1.3.2.cmml" xref="S3.E6.m1.1.1.3.2">𝑎</ci><ci id="S3.E6.m1.1.1.3.3.cmml" xref="S3.E6.m1.1.1.3.3">𝑠</ci><ci id="S3.E6.m1.1.1.3.4.cmml" xref="S3.E6.m1.1.1.3.4">𝑣</ci></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.E6.m1.1c">\displaystyle\boldsymbol{H}_{asv}</annotation><annotation encoding="application/x-llamapun" id="S3.E6.m1.1d">bold_italic_H start_POSTSUBSCRIPT italic_a italic_s italic_v end_POSTSUBSCRIPT</annotation></semantics></math></td> <td class="ltx_td ltx_align_left ltx_eqn_cell"><math alttext="\displaystyle=\frac{1}{N}\sum_{n=1}^{N}\boldsymbol{H}_{right}^{n}," class="ltx_Math" display="inline" id="S3.E6.m2.1"><semantics id="S3.E6.m2.1a"><mrow id="S3.E6.m2.1.1.1" xref="S3.E6.m2.1.1.1.1.cmml"><mrow id="S3.E6.m2.1.1.1.1" xref="S3.E6.m2.1.1.1.1.cmml"><mi id="S3.E6.m2.1.1.1.1.2" xref="S3.E6.m2.1.1.1.1.2.cmml"></mi><mo id="S3.E6.m2.1.1.1.1.1" mathsize="90%" xref="S3.E6.m2.1.1.1.1.1.cmml">=</mo><mrow id="S3.E6.m2.1.1.1.1.3" xref="S3.E6.m2.1.1.1.1.3.cmml"><mstyle displaystyle="true" id="S3.E6.m2.1.1.1.1.3.2" xref="S3.E6.m2.1.1.1.1.3.2.cmml"><mfrac id="S3.E6.m2.1.1.1.1.3.2a" xref="S3.E6.m2.1.1.1.1.3.2.cmml"><mn id="S3.E6.m2.1.1.1.1.3.2.2" mathsize="90%" xref="S3.E6.m2.1.1.1.1.3.2.2.cmml">1</mn><mi id="S3.E6.m2.1.1.1.1.3.2.3" mathsize="90%" xref="S3.E6.m2.1.1.1.1.3.2.3.cmml">N</mi></mfrac></mstyle><mo id="S3.E6.m2.1.1.1.1.3.1" xref="S3.E6.m2.1.1.1.1.3.1.cmml">⁢</mo><mrow id="S3.E6.m2.1.1.1.1.3.3" xref="S3.E6.m2.1.1.1.1.3.3.cmml"><mstyle displaystyle="true" id="S3.E6.m2.1.1.1.1.3.3.1" xref="S3.E6.m2.1.1.1.1.3.3.1.cmml"><munderover id="S3.E6.m2.1.1.1.1.3.3.1a" xref="S3.E6.m2.1.1.1.1.3.3.1.cmml"><mo id="S3.E6.m2.1.1.1.1.3.3.1.2.2" maxsize="90%" minsize="90%" movablelimits="false" stretchy="true" xref="S3.E6.m2.1.1.1.1.3.3.1.2.2.cmml">∑</mo><mrow id="S3.E6.m2.1.1.1.1.3.3.1.2.3" xref="S3.E6.m2.1.1.1.1.3.3.1.2.3.cmml"><mi id="S3.E6.m2.1.1.1.1.3.3.1.2.3.2" mathsize="90%" xref="S3.E6.m2.1.1.1.1.3.3.1.2.3.2.cmml">n</mi><mo id="S3.E6.m2.1.1.1.1.3.3.1.2.3.1" mathsize="90%" xref="S3.E6.m2.1.1.1.1.3.3.1.2.3.1.cmml">=</mo><mn id="S3.E6.m2.1.1.1.1.3.3.1.2.3.3" mathsize="90%" xref="S3.E6.m2.1.1.1.1.3.3.1.2.3.3.cmml">1</mn></mrow><mi id="S3.E6.m2.1.1.1.1.3.3.1.3" mathsize="90%" xref="S3.E6.m2.1.1.1.1.3.3.1.3.cmml">N</mi></munderover></mstyle><msubsup id="S3.E6.m2.1.1.1.1.3.3.2" xref="S3.E6.m2.1.1.1.1.3.3.2.cmml"><mi id="S3.E6.m2.1.1.1.1.3.3.2.2.2" mathsize="90%" xref="S3.E6.m2.1.1.1.1.3.3.2.2.2.cmml">𝑯</mi><mrow id="S3.E6.m2.1.1.1.1.3.3.2.2.3" xref="S3.E6.m2.1.1.1.1.3.3.2.2.3.cmml"><mi id="S3.E6.m2.1.1.1.1.3.3.2.2.3.2" mathsize="90%" xref="S3.E6.m2.1.1.1.1.3.3.2.2.3.2.cmml">r</mi><mo id="S3.E6.m2.1.1.1.1.3.3.2.2.3.1" xref="S3.E6.m2.1.1.1.1.3.3.2.2.3.1.cmml">⁢</mo><mi id="S3.E6.m2.1.1.1.1.3.3.2.2.3.3" mathsize="90%" xref="S3.E6.m2.1.1.1.1.3.3.2.2.3.3.cmml">i</mi><mo id="S3.E6.m2.1.1.1.1.3.3.2.2.3.1a" xref="S3.E6.m2.1.1.1.1.3.3.2.2.3.1.cmml">⁢</mo><mi id="S3.E6.m2.1.1.1.1.3.3.2.2.3.4" mathsize="90%" xref="S3.E6.m2.1.1.1.1.3.3.2.2.3.4.cmml">g</mi><mo id="S3.E6.m2.1.1.1.1.3.3.2.2.3.1b" xref="S3.E6.m2.1.1.1.1.3.3.2.2.3.1.cmml">⁢</mo><mi id="S3.E6.m2.1.1.1.1.3.3.2.2.3.5" mathsize="90%" xref="S3.E6.m2.1.1.1.1.3.3.2.2.3.5.cmml">h</mi><mo id="S3.E6.m2.1.1.1.1.3.3.2.2.3.1c" xref="S3.E6.m2.1.1.1.1.3.3.2.2.3.1.cmml">⁢</mo><mi id="S3.E6.m2.1.1.1.1.3.3.2.2.3.6" mathsize="90%" xref="S3.E6.m2.1.1.1.1.3.3.2.2.3.6.cmml">t</mi></mrow><mi id="S3.E6.m2.1.1.1.1.3.3.2.3" mathsize="90%" xref="S3.E6.m2.1.1.1.1.3.3.2.3.cmml">n</mi></msubsup></mrow></mrow></mrow><mo id="S3.E6.m2.1.1.1.2" mathsize="90%" xref="S3.E6.m2.1.1.1.1.cmml">,</mo></mrow><annotation-xml encoding="MathML-Content" id="S3.E6.m2.1b"><apply id="S3.E6.m2.1.1.1.1.cmml" xref="S3.E6.m2.1.1.1"><eq id="S3.E6.m2.1.1.1.1.1.cmml" xref="S3.E6.m2.1.1.1.1.1"></eq><csymbol cd="latexml" id="S3.E6.m2.1.1.1.1.2.cmml" xref="S3.E6.m2.1.1.1.1.2">absent</csymbol><apply id="S3.E6.m2.1.1.1.1.3.cmml" xref="S3.E6.m2.1.1.1.1.3"><times id="S3.E6.m2.1.1.1.1.3.1.cmml" xref="S3.E6.m2.1.1.1.1.3.1"></times><apply id="S3.E6.m2.1.1.1.1.3.2.cmml" xref="S3.E6.m2.1.1.1.1.3.2"><divide id="S3.E6.m2.1.1.1.1.3.2.1.cmml" xref="S3.E6.m2.1.1.1.1.3.2"></divide><cn id="S3.E6.m2.1.1.1.1.3.2.2.cmml" type="integer" xref="S3.E6.m2.1.1.1.1.3.2.2">1</cn><ci id="S3.E6.m2.1.1.1.1.3.2.3.cmml" xref="S3.E6.m2.1.1.1.1.3.2.3">𝑁</ci></apply><apply id="S3.E6.m2.1.1.1.1.3.3.cmml" xref="S3.E6.m2.1.1.1.1.3.3"><apply id="S3.E6.m2.1.1.1.1.3.3.1.cmml" xref="S3.E6.m2.1.1.1.1.3.3.1"><csymbol cd="ambiguous" id="S3.E6.m2.1.1.1.1.3.3.1.1.cmml" xref="S3.E6.m2.1.1.1.1.3.3.1">superscript</csymbol><apply id="S3.E6.m2.1.1.1.1.3.3.1.2.cmml" xref="S3.E6.m2.1.1.1.1.3.3.1"><csymbol cd="ambiguous" id="S3.E6.m2.1.1.1.1.3.3.1.2.1.cmml" xref="S3.E6.m2.1.1.1.1.3.3.1">subscript</csymbol><sum id="S3.E6.m2.1.1.1.1.3.3.1.2.2.cmml" xref="S3.E6.m2.1.1.1.1.3.3.1.2.2"></sum><apply id="S3.E6.m2.1.1.1.1.3.3.1.2.3.cmml" xref="S3.E6.m2.1.1.1.1.3.3.1.2.3"><eq id="S3.E6.m2.1.1.1.1.3.3.1.2.3.1.cmml" xref="S3.E6.m2.1.1.1.1.3.3.1.2.3.1"></eq><ci id="S3.E6.m2.1.1.1.1.3.3.1.2.3.2.cmml" xref="S3.E6.m2.1.1.1.1.3.3.1.2.3.2">𝑛</ci><cn id="S3.E6.m2.1.1.1.1.3.3.1.2.3.3.cmml" type="integer" xref="S3.E6.m2.1.1.1.1.3.3.1.2.3.3">1</cn></apply></apply><ci id="S3.E6.m2.1.1.1.1.3.3.1.3.cmml" xref="S3.E6.m2.1.1.1.1.3.3.1.3">𝑁</ci></apply><apply id="S3.E6.m2.1.1.1.1.3.3.2.cmml" xref="S3.E6.m2.1.1.1.1.3.3.2"><csymbol cd="ambiguous" id="S3.E6.m2.1.1.1.1.3.3.2.1.cmml" xref="S3.E6.m2.1.1.1.1.3.3.2">superscript</csymbol><apply id="S3.E6.m2.1.1.1.1.3.3.2.2.cmml" xref="S3.E6.m2.1.1.1.1.3.3.2"><csymbol cd="ambiguous" id="S3.E6.m2.1.1.1.1.3.3.2.2.1.cmml" xref="S3.E6.m2.1.1.1.1.3.3.2">subscript</csymbol><ci id="S3.E6.m2.1.1.1.1.3.3.2.2.2.cmml" xref="S3.E6.m2.1.1.1.1.3.3.2.2.2">𝑯</ci><apply id="S3.E6.m2.1.1.1.1.3.3.2.2.3.cmml" xref="S3.E6.m2.1.1.1.1.3.3.2.2.3"><times id="S3.E6.m2.1.1.1.1.3.3.2.2.3.1.cmml" xref="S3.E6.m2.1.1.1.1.3.3.2.2.3.1"></times><ci id="S3.E6.m2.1.1.1.1.3.3.2.2.3.2.cmml" xref="S3.E6.m2.1.1.1.1.3.3.2.2.3.2">𝑟</ci><ci id="S3.E6.m2.1.1.1.1.3.3.2.2.3.3.cmml" xref="S3.E6.m2.1.1.1.1.3.3.2.2.3.3">𝑖</ci><ci id="S3.E6.m2.1.1.1.1.3.3.2.2.3.4.cmml" xref="S3.E6.m2.1.1.1.1.3.3.2.2.3.4">𝑔</ci><ci id="S3.E6.m2.1.1.1.1.3.3.2.2.3.5.cmml" xref="S3.E6.m2.1.1.1.1.3.3.2.2.3.5">ℎ</ci><ci id="S3.E6.m2.1.1.1.1.3.3.2.2.3.6.cmml" xref="S3.E6.m2.1.1.1.1.3.3.2.2.3.6">𝑡</ci></apply></apply><ci id="S3.E6.m2.1.1.1.1.3.3.2.3.cmml" xref="S3.E6.m2.1.1.1.1.3.3.2.3">𝑛</ci></apply></apply></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.E6.m2.1c">\displaystyle=\frac{1}{N}\sum_{n=1}^{N}\boldsymbol{H}_{right}^{n},</annotation><annotation encoding="application/x-llamapun" id="S3.E6.m2.1d">= divide start_ARG 1 end_ARG start_ARG italic_N end_ARG ∑ start_POSTSUBSCRIPT italic_n = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT bold_italic_H start_POSTSUBSCRIPT italic_r italic_i italic_g italic_h italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ,</annotation></semantics></math></td> <td class="ltx_eqn_cell ltx_eqn_center_padright"></td> <td class="ltx_eqn_cell ltx_eqn_eqno ltx_align_middle ltx_align_right" rowspan="1"><span class="ltx_tag ltx_tag_equation ltx_align_right">(6)</span></td> </tr></tbody> </table> <p class="ltx_p" id="S3.SS2.SSS2.p1.2"><span class="ltx_text" id="S3.SS2.SSS2.p1.2.1" style="font-size:90%;">Here, </span><math alttext="\boldsymbol{H}_{asv}" class="ltx_Math" display="inline" id="S3.SS2.SSS2.p1.1.m1.1"><semantics id="S3.SS2.SSS2.p1.1.m1.1a"><msub id="S3.SS2.SSS2.p1.1.m1.1.1" xref="S3.SS2.SSS2.p1.1.m1.1.1.cmml"><mi id="S3.SS2.SSS2.p1.1.m1.1.1.2" mathsize="90%" xref="S3.SS2.SSS2.p1.1.m1.1.1.2.cmml">𝑯</mi><mrow id="S3.SS2.SSS2.p1.1.m1.1.1.3" xref="S3.SS2.SSS2.p1.1.m1.1.1.3.cmml"><mi id="S3.SS2.SSS2.p1.1.m1.1.1.3.2" mathsize="90%" xref="S3.SS2.SSS2.p1.1.m1.1.1.3.2.cmml">a</mi><mo id="S3.SS2.SSS2.p1.1.m1.1.1.3.1" xref="S3.SS2.SSS2.p1.1.m1.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS2.p1.1.m1.1.1.3.3" mathsize="90%" xref="S3.SS2.SSS2.p1.1.m1.1.1.3.3.cmml">s</mi><mo id="S3.SS2.SSS2.p1.1.m1.1.1.3.1a" xref="S3.SS2.SSS2.p1.1.m1.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS2.p1.1.m1.1.1.3.4" mathsize="90%" xref="S3.SS2.SSS2.p1.1.m1.1.1.3.4.cmml">v</mi></mrow></msub><annotation-xml encoding="MathML-Content" id="S3.SS2.SSS2.p1.1.m1.1b"><apply id="S3.SS2.SSS2.p1.1.m1.1.1.cmml" xref="S3.SS2.SSS2.p1.1.m1.1.1"><csymbol cd="ambiguous" id="S3.SS2.SSS2.p1.1.m1.1.1.1.cmml" xref="S3.SS2.SSS2.p1.1.m1.1.1">subscript</csymbol><ci id="S3.SS2.SSS2.p1.1.m1.1.1.2.cmml" xref="S3.SS2.SSS2.p1.1.m1.1.1.2">𝑯</ci><apply id="S3.SS2.SSS2.p1.1.m1.1.1.3.cmml" xref="S3.SS2.SSS2.p1.1.m1.1.1.3"><times id="S3.SS2.SSS2.p1.1.m1.1.1.3.1.cmml" xref="S3.SS2.SSS2.p1.1.m1.1.1.3.1"></times><ci id="S3.SS2.SSS2.p1.1.m1.1.1.3.2.cmml" xref="S3.SS2.SSS2.p1.1.m1.1.1.3.2">𝑎</ci><ci id="S3.SS2.SSS2.p1.1.m1.1.1.3.3.cmml" xref="S3.SS2.SSS2.p1.1.m1.1.1.3.3">𝑠</ci><ci id="S3.SS2.SSS2.p1.1.m1.1.1.3.4.cmml" xref="S3.SS2.SSS2.p1.1.m1.1.1.3.4">𝑣</ci></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.SS2.SSS2.p1.1.m1.1c">\boldsymbol{H}_{asv}</annotation><annotation encoding="application/x-llamapun" id="S3.SS2.SSS2.p1.1.m1.1d">bold_italic_H start_POSTSUBSCRIPT italic_a italic_s italic_v end_POSTSUBSCRIPT</annotation></semantics></math><span class="ltx_text" id="S3.SS2.SSS2.p1.2.2" style="font-size:90%;"> and </span><math alttext="\boldsymbol{H}_{spoof}" class="ltx_Math" display="inline" id="S3.SS2.SSS2.p1.2.m2.1"><semantics id="S3.SS2.SSS2.p1.2.m2.1a"><msub id="S3.SS2.SSS2.p1.2.m2.1.1" xref="S3.SS2.SSS2.p1.2.m2.1.1.cmml"><mi id="S3.SS2.SSS2.p1.2.m2.1.1.2" mathsize="90%" xref="S3.SS2.SSS2.p1.2.m2.1.1.2.cmml">𝑯</mi><mrow id="S3.SS2.SSS2.p1.2.m2.1.1.3" xref="S3.SS2.SSS2.p1.2.m2.1.1.3.cmml"><mi id="S3.SS2.SSS2.p1.2.m2.1.1.3.2" mathsize="90%" xref="S3.SS2.SSS2.p1.2.m2.1.1.3.2.cmml">s</mi><mo id="S3.SS2.SSS2.p1.2.m2.1.1.3.1" xref="S3.SS2.SSS2.p1.2.m2.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS2.p1.2.m2.1.1.3.3" mathsize="90%" xref="S3.SS2.SSS2.p1.2.m2.1.1.3.3.cmml">p</mi><mo id="S3.SS2.SSS2.p1.2.m2.1.1.3.1a" xref="S3.SS2.SSS2.p1.2.m2.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS2.p1.2.m2.1.1.3.4" mathsize="90%" xref="S3.SS2.SSS2.p1.2.m2.1.1.3.4.cmml">o</mi><mo id="S3.SS2.SSS2.p1.2.m2.1.1.3.1b" xref="S3.SS2.SSS2.p1.2.m2.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS2.p1.2.m2.1.1.3.5" mathsize="90%" xref="S3.SS2.SSS2.p1.2.m2.1.1.3.5.cmml">o</mi><mo id="S3.SS2.SSS2.p1.2.m2.1.1.3.1c" xref="S3.SS2.SSS2.p1.2.m2.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS2.p1.2.m2.1.1.3.6" mathsize="90%" xref="S3.SS2.SSS2.p1.2.m2.1.1.3.6.cmml">f</mi></mrow></msub><annotation-xml encoding="MathML-Content" id="S3.SS2.SSS2.p1.2.m2.1b"><apply id="S3.SS2.SSS2.p1.2.m2.1.1.cmml" xref="S3.SS2.SSS2.p1.2.m2.1.1"><csymbol cd="ambiguous" id="S3.SS2.SSS2.p1.2.m2.1.1.1.cmml" xref="S3.SS2.SSS2.p1.2.m2.1.1">subscript</csymbol><ci id="S3.SS2.SSS2.p1.2.m2.1.1.2.cmml" xref="S3.SS2.SSS2.p1.2.m2.1.1.2">𝑯</ci><apply id="S3.SS2.SSS2.p1.2.m2.1.1.3.cmml" xref="S3.SS2.SSS2.p1.2.m2.1.1.3"><times id="S3.SS2.SSS2.p1.2.m2.1.1.3.1.cmml" xref="S3.SS2.SSS2.p1.2.m2.1.1.3.1"></times><ci id="S3.SS2.SSS2.p1.2.m2.1.1.3.2.cmml" xref="S3.SS2.SSS2.p1.2.m2.1.1.3.2">𝑠</ci><ci id="S3.SS2.SSS2.p1.2.m2.1.1.3.3.cmml" xref="S3.SS2.SSS2.p1.2.m2.1.1.3.3">𝑝</ci><ci id="S3.SS2.SSS2.p1.2.m2.1.1.3.4.cmml" xref="S3.SS2.SSS2.p1.2.m2.1.1.3.4">𝑜</ci><ci id="S3.SS2.SSS2.p1.2.m2.1.1.3.5.cmml" xref="S3.SS2.SSS2.p1.2.m2.1.1.3.5">𝑜</ci><ci id="S3.SS2.SSS2.p1.2.m2.1.1.3.6.cmml" xref="S3.SS2.SSS2.p1.2.m2.1.1.3.6">𝑓</ci></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.SS2.SSS2.p1.2.m2.1c">\boldsymbol{H}_{spoof}</annotation><annotation encoding="application/x-llamapun" id="S3.SS2.SSS2.p1.2.m2.1d">bold_italic_H start_POSTSUBSCRIPT italic_s italic_p italic_o italic_o italic_f end_POSTSUBSCRIPT</annotation></semantics></math><span class="ltx_text" id="S3.SS2.SSS2.p1.2.3" style="font-size:90%;"> represent the aggregated output for the ASV and anti-spoofing tasks, respectively.</span></p> </div> <div class="ltx_para" id="S3.SS2.SSS2.p2"> <p class="ltx_p" id="S3.SS2.SSS2.p2.2"><span class="ltx_text" id="S3.SS2.SSS2.p2.2.1" style="font-size:90%;">Both of the aggregated features undergo transformation through a one-dimensional convolutional layer followed by an average pooling layer. Subsequently, the classification probabilities are computed based on the outputs </span><math alttext="\boldsymbol{e}_{asv}" class="ltx_Math" display="inline" id="S3.SS2.SSS2.p2.1.m1.1"><semantics id="S3.SS2.SSS2.p2.1.m1.1a"><msub id="S3.SS2.SSS2.p2.1.m1.1.1" xref="S3.SS2.SSS2.p2.1.m1.1.1.cmml"><mi id="S3.SS2.SSS2.p2.1.m1.1.1.2" mathsize="90%" xref="S3.SS2.SSS2.p2.1.m1.1.1.2.cmml">𝒆</mi><mrow id="S3.SS2.SSS2.p2.1.m1.1.1.3" xref="S3.SS2.SSS2.p2.1.m1.1.1.3.cmml"><mi id="S3.SS2.SSS2.p2.1.m1.1.1.3.2" mathsize="90%" xref="S3.SS2.SSS2.p2.1.m1.1.1.3.2.cmml">a</mi><mo id="S3.SS2.SSS2.p2.1.m1.1.1.3.1" xref="S3.SS2.SSS2.p2.1.m1.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS2.p2.1.m1.1.1.3.3" mathsize="90%" xref="S3.SS2.SSS2.p2.1.m1.1.1.3.3.cmml">s</mi><mo id="S3.SS2.SSS2.p2.1.m1.1.1.3.1a" xref="S3.SS2.SSS2.p2.1.m1.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS2.p2.1.m1.1.1.3.4" mathsize="90%" xref="S3.SS2.SSS2.p2.1.m1.1.1.3.4.cmml">v</mi></mrow></msub><annotation-xml encoding="MathML-Content" id="S3.SS2.SSS2.p2.1.m1.1b"><apply id="S3.SS2.SSS2.p2.1.m1.1.1.cmml" xref="S3.SS2.SSS2.p2.1.m1.1.1"><csymbol cd="ambiguous" id="S3.SS2.SSS2.p2.1.m1.1.1.1.cmml" xref="S3.SS2.SSS2.p2.1.m1.1.1">subscript</csymbol><ci id="S3.SS2.SSS2.p2.1.m1.1.1.2.cmml" xref="S3.SS2.SSS2.p2.1.m1.1.1.2">𝒆</ci><apply id="S3.SS2.SSS2.p2.1.m1.1.1.3.cmml" xref="S3.SS2.SSS2.p2.1.m1.1.1.3"><times id="S3.SS2.SSS2.p2.1.m1.1.1.3.1.cmml" xref="S3.SS2.SSS2.p2.1.m1.1.1.3.1"></times><ci id="S3.SS2.SSS2.p2.1.m1.1.1.3.2.cmml" xref="S3.SS2.SSS2.p2.1.m1.1.1.3.2">𝑎</ci><ci id="S3.SS2.SSS2.p2.1.m1.1.1.3.3.cmml" xref="S3.SS2.SSS2.p2.1.m1.1.1.3.3">𝑠</ci><ci id="S3.SS2.SSS2.p2.1.m1.1.1.3.4.cmml" xref="S3.SS2.SSS2.p2.1.m1.1.1.3.4">𝑣</ci></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.SS2.SSS2.p2.1.m1.1c">\boldsymbol{e}_{asv}</annotation><annotation encoding="application/x-llamapun" id="S3.SS2.SSS2.p2.1.m1.1d">bold_italic_e start_POSTSUBSCRIPT italic_a italic_s italic_v end_POSTSUBSCRIPT</annotation></semantics></math><span class="ltx_text" id="S3.SS2.SSS2.p2.2.2" style="font-size:90%;"> and </span><math alttext="\boldsymbol{e}_{spoof}" class="ltx_Math" display="inline" id="S3.SS2.SSS2.p2.2.m2.1"><semantics id="S3.SS2.SSS2.p2.2.m2.1a"><msub id="S3.SS2.SSS2.p2.2.m2.1.1" xref="S3.SS2.SSS2.p2.2.m2.1.1.cmml"><mi id="S3.SS2.SSS2.p2.2.m2.1.1.2" mathsize="90%" xref="S3.SS2.SSS2.p2.2.m2.1.1.2.cmml">𝒆</mi><mrow id="S3.SS2.SSS2.p2.2.m2.1.1.3" xref="S3.SS2.SSS2.p2.2.m2.1.1.3.cmml"><mi id="S3.SS2.SSS2.p2.2.m2.1.1.3.2" mathsize="90%" xref="S3.SS2.SSS2.p2.2.m2.1.1.3.2.cmml">s</mi><mo id="S3.SS2.SSS2.p2.2.m2.1.1.3.1" xref="S3.SS2.SSS2.p2.2.m2.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS2.p2.2.m2.1.1.3.3" mathsize="90%" xref="S3.SS2.SSS2.p2.2.m2.1.1.3.3.cmml">p</mi><mo id="S3.SS2.SSS2.p2.2.m2.1.1.3.1a" xref="S3.SS2.SSS2.p2.2.m2.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS2.p2.2.m2.1.1.3.4" mathsize="90%" xref="S3.SS2.SSS2.p2.2.m2.1.1.3.4.cmml">o</mi><mo id="S3.SS2.SSS2.p2.2.m2.1.1.3.1b" xref="S3.SS2.SSS2.p2.2.m2.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS2.p2.2.m2.1.1.3.5" mathsize="90%" xref="S3.SS2.SSS2.p2.2.m2.1.1.3.5.cmml">o</mi><mo id="S3.SS2.SSS2.p2.2.m2.1.1.3.1c" xref="S3.SS2.SSS2.p2.2.m2.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS2.p2.2.m2.1.1.3.6" mathsize="90%" xref="S3.SS2.SSS2.p2.2.m2.1.1.3.6.cmml">f</mi></mrow></msub><annotation-xml encoding="MathML-Content" id="S3.SS2.SSS2.p2.2.m2.1b"><apply id="S3.SS2.SSS2.p2.2.m2.1.1.cmml" xref="S3.SS2.SSS2.p2.2.m2.1.1"><csymbol cd="ambiguous" id="S3.SS2.SSS2.p2.2.m2.1.1.1.cmml" xref="S3.SS2.SSS2.p2.2.m2.1.1">subscript</csymbol><ci id="S3.SS2.SSS2.p2.2.m2.1.1.2.cmml" xref="S3.SS2.SSS2.p2.2.m2.1.1.2">𝒆</ci><apply id="S3.SS2.SSS2.p2.2.m2.1.1.3.cmml" xref="S3.SS2.SSS2.p2.2.m2.1.1.3"><times id="S3.SS2.SSS2.p2.2.m2.1.1.3.1.cmml" xref="S3.SS2.SSS2.p2.2.m2.1.1.3.1"></times><ci id="S3.SS2.SSS2.p2.2.m2.1.1.3.2.cmml" xref="S3.SS2.SSS2.p2.2.m2.1.1.3.2">𝑠</ci><ci id="S3.SS2.SSS2.p2.2.m2.1.1.3.3.cmml" xref="S3.SS2.SSS2.p2.2.m2.1.1.3.3">𝑝</ci><ci id="S3.SS2.SSS2.p2.2.m2.1.1.3.4.cmml" xref="S3.SS2.SSS2.p2.2.m2.1.1.3.4">𝑜</ci><ci id="S3.SS2.SSS2.p2.2.m2.1.1.3.5.cmml" xref="S3.SS2.SSS2.p2.2.m2.1.1.3.5">𝑜</ci><ci id="S3.SS2.SSS2.p2.2.m2.1.1.3.6.cmml" xref="S3.SS2.SSS2.p2.2.m2.1.1.3.6">𝑓</ci></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.SS2.SSS2.p2.2.m2.1c">\boldsymbol{e}_{spoof}</annotation><annotation encoding="application/x-llamapun" id="S3.SS2.SSS2.p2.2.m2.1d">bold_italic_e start_POSTSUBSCRIPT italic_s italic_p italic_o italic_o italic_f end_POSTSUBSCRIPT</annotation></semantics></math><span class="ltx_text" id="S3.SS2.SSS2.p2.2.3" style="font-size:90%;"> from the pooling layers. This computation can be expressed as follows:</span></p> <table class="ltx_equationgroup ltx_eqn_align ltx_eqn_table" id="S6.EGx4"> <tbody id="S3.E7"><tr class="ltx_equation ltx_eqn_row ltx_align_baseline"> <td class="ltx_eqn_cell ltx_eqn_center_padleft"></td> <td class="ltx_td ltx_align_right ltx_eqn_cell"><math alttext="\displaystyle P(\boldsymbol{e}_{spoof})" class="ltx_Math" display="inline" id="S3.E7.m1.1"><semantics id="S3.E7.m1.1a"><mrow id="S3.E7.m1.1.1" xref="S3.E7.m1.1.1.cmml"><mi id="S3.E7.m1.1.1.3" mathsize="90%" xref="S3.E7.m1.1.1.3.cmml">P</mi><mo id="S3.E7.m1.1.1.2" xref="S3.E7.m1.1.1.2.cmml">⁢</mo><mrow id="S3.E7.m1.1.1.1.1" xref="S3.E7.m1.1.1.1.1.1.cmml"><mo id="S3.E7.m1.1.1.1.1.2" maxsize="90%" minsize="90%" xref="S3.E7.m1.1.1.1.1.1.cmml">(</mo><msub id="S3.E7.m1.1.1.1.1.1" xref="S3.E7.m1.1.1.1.1.1.cmml"><mi id="S3.E7.m1.1.1.1.1.1.2" mathsize="90%" xref="S3.E7.m1.1.1.1.1.1.2.cmml">𝒆</mi><mrow id="S3.E7.m1.1.1.1.1.1.3" xref="S3.E7.m1.1.1.1.1.1.3.cmml"><mi id="S3.E7.m1.1.1.1.1.1.3.2" mathsize="90%" xref="S3.E7.m1.1.1.1.1.1.3.2.cmml">s</mi><mo id="S3.E7.m1.1.1.1.1.1.3.1" xref="S3.E7.m1.1.1.1.1.1.3.1.cmml">⁢</mo><mi id="S3.E7.m1.1.1.1.1.1.3.3" mathsize="90%" xref="S3.E7.m1.1.1.1.1.1.3.3.cmml">p</mi><mo id="S3.E7.m1.1.1.1.1.1.3.1a" xref="S3.E7.m1.1.1.1.1.1.3.1.cmml">⁢</mo><mi id="S3.E7.m1.1.1.1.1.1.3.4" mathsize="90%" xref="S3.E7.m1.1.1.1.1.1.3.4.cmml">o</mi><mo id="S3.E7.m1.1.1.1.1.1.3.1b" xref="S3.E7.m1.1.1.1.1.1.3.1.cmml">⁢</mo><mi id="S3.E7.m1.1.1.1.1.1.3.5" mathsize="90%" xref="S3.E7.m1.1.1.1.1.1.3.5.cmml">o</mi><mo id="S3.E7.m1.1.1.1.1.1.3.1c" xref="S3.E7.m1.1.1.1.1.1.3.1.cmml">⁢</mo><mi id="S3.E7.m1.1.1.1.1.1.3.6" mathsize="90%" xref="S3.E7.m1.1.1.1.1.1.3.6.cmml">f</mi></mrow></msub><mo id="S3.E7.m1.1.1.1.1.3" maxsize="90%" minsize="90%" xref="S3.E7.m1.1.1.1.1.1.cmml">)</mo></mrow></mrow><annotation-xml encoding="MathML-Content" id="S3.E7.m1.1b"><apply id="S3.E7.m1.1.1.cmml" xref="S3.E7.m1.1.1"><times id="S3.E7.m1.1.1.2.cmml" xref="S3.E7.m1.1.1.2"></times><ci id="S3.E7.m1.1.1.3.cmml" xref="S3.E7.m1.1.1.3">𝑃</ci><apply id="S3.E7.m1.1.1.1.1.1.cmml" xref="S3.E7.m1.1.1.1.1"><csymbol cd="ambiguous" id="S3.E7.m1.1.1.1.1.1.1.cmml" xref="S3.E7.m1.1.1.1.1">subscript</csymbol><ci id="S3.E7.m1.1.1.1.1.1.2.cmml" xref="S3.E7.m1.1.1.1.1.1.2">𝒆</ci><apply id="S3.E7.m1.1.1.1.1.1.3.cmml" xref="S3.E7.m1.1.1.1.1.1.3"><times id="S3.E7.m1.1.1.1.1.1.3.1.cmml" xref="S3.E7.m1.1.1.1.1.1.3.1"></times><ci id="S3.E7.m1.1.1.1.1.1.3.2.cmml" xref="S3.E7.m1.1.1.1.1.1.3.2">𝑠</ci><ci id="S3.E7.m1.1.1.1.1.1.3.3.cmml" xref="S3.E7.m1.1.1.1.1.1.3.3">𝑝</ci><ci id="S3.E7.m1.1.1.1.1.1.3.4.cmml" xref="S3.E7.m1.1.1.1.1.1.3.4">𝑜</ci><ci id="S3.E7.m1.1.1.1.1.1.3.5.cmml" xref="S3.E7.m1.1.1.1.1.1.3.5">𝑜</ci><ci id="S3.E7.m1.1.1.1.1.1.3.6.cmml" xref="S3.E7.m1.1.1.1.1.1.3.6">𝑓</ci></apply></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.E7.m1.1c">\displaystyle P(\boldsymbol{e}_{spoof})</annotation><annotation encoding="application/x-llamapun" id="S3.E7.m1.1d">italic_P ( bold_italic_e start_POSTSUBSCRIPT italic_s italic_p italic_o italic_o italic_f end_POSTSUBSCRIPT )</annotation></semantics></math></td> <td class="ltx_td ltx_align_left ltx_eqn_cell"><math alttext="\displaystyle=\frac{1}{1+\exp(-\boldsymbol{e}_{spoof}^{\top}\boldsymbol{\theta% }_{spoof})}," class="ltx_Math" display="inline" id="S3.E7.m2.3"><semantics id="S3.E7.m2.3a"><mrow id="S3.E7.m2.3.3.1" xref="S3.E7.m2.3.3.1.1.cmml"><mrow id="S3.E7.m2.3.3.1.1" xref="S3.E7.m2.3.3.1.1.cmml"><mi id="S3.E7.m2.3.3.1.1.2" xref="S3.E7.m2.3.3.1.1.2.cmml"></mi><mo id="S3.E7.m2.3.3.1.1.1" mathsize="90%" xref="S3.E7.m2.3.3.1.1.1.cmml">=</mo><mstyle displaystyle="true" id="S3.E7.m2.2.2" xref="S3.E7.m2.2.2.cmml"><mfrac id="S3.E7.m2.2.2a" xref="S3.E7.m2.2.2.cmml"><mn id="S3.E7.m2.2.2.4" mathsize="90%" xref="S3.E7.m2.2.2.4.cmml">1</mn><mrow id="S3.E7.m2.2.2.2" xref="S3.E7.m2.2.2.2.cmml"><mn id="S3.E7.m2.2.2.2.4" mathsize="90%" xref="S3.E7.m2.2.2.2.4.cmml">1</mn><mo id="S3.E7.m2.2.2.2.3" mathsize="90%" xref="S3.E7.m2.2.2.2.3.cmml">+</mo><mrow id="S3.E7.m2.2.2.2.2.1" xref="S3.E7.m2.2.2.2.2.2.cmml"><mi id="S3.E7.m2.1.1.1.1" mathsize="90%" xref="S3.E7.m2.1.1.1.1.cmml">exp</mi><mo id="S3.E7.m2.2.2.2.2.1a" xref="S3.E7.m2.2.2.2.2.2.cmml">⁡</mo><mrow id="S3.E7.m2.2.2.2.2.1.1" xref="S3.E7.m2.2.2.2.2.2.cmml"><mo id="S3.E7.m2.2.2.2.2.1.1.2" maxsize="90%" minsize="90%" xref="S3.E7.m2.2.2.2.2.2.cmml">(</mo><mrow id="S3.E7.m2.2.2.2.2.1.1.1" xref="S3.E7.m2.2.2.2.2.1.1.1.cmml"><mo id="S3.E7.m2.2.2.2.2.1.1.1a" mathsize="90%" xref="S3.E7.m2.2.2.2.2.1.1.1.cmml">−</mo><mrow id="S3.E7.m2.2.2.2.2.1.1.1.2" xref="S3.E7.m2.2.2.2.2.1.1.1.2.cmml"><msubsup id="S3.E7.m2.2.2.2.2.1.1.1.2.2" xref="S3.E7.m2.2.2.2.2.1.1.1.2.2.cmml"><mi id="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.2" mathsize="90%" xref="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.2.cmml">𝒆</mi><mrow id="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.3" xref="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.3.cmml"><mi id="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.3.2" mathsize="90%" xref="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.3.2.cmml">s</mi><mo id="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.3.1" xref="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.3.1.cmml">⁢</mo><mi id="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.3.3" mathsize="90%" xref="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.3.3.cmml">p</mi><mo id="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.3.1a" xref="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.3.1.cmml">⁢</mo><mi id="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.3.4" mathsize="90%" xref="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.3.4.cmml">o</mi><mo id="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.3.1b" xref="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.3.1.cmml">⁢</mo><mi id="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.3.5" mathsize="90%" xref="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.3.5.cmml">o</mi><mo id="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.3.1c" xref="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.3.1.cmml">⁢</mo><mi id="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.3.6" mathsize="90%" xref="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.3.6.cmml">f</mi></mrow><mo id="S3.E7.m2.2.2.2.2.1.1.1.2.2.3" mathsize="90%" xref="S3.E7.m2.2.2.2.2.1.1.1.2.2.3.cmml">⊤</mo></msubsup><mo id="S3.E7.m2.2.2.2.2.1.1.1.2.1" xref="S3.E7.m2.2.2.2.2.1.1.1.2.1.cmml">⁢</mo><msub id="S3.E7.m2.2.2.2.2.1.1.1.2.3" xref="S3.E7.m2.2.2.2.2.1.1.1.2.3.cmml"><mi id="S3.E7.m2.2.2.2.2.1.1.1.2.3.2" mathsize="90%" xref="S3.E7.m2.2.2.2.2.1.1.1.2.3.2.cmml">𝜽</mi><mrow id="S3.E7.m2.2.2.2.2.1.1.1.2.3.3" xref="S3.E7.m2.2.2.2.2.1.1.1.2.3.3.cmml"><mi id="S3.E7.m2.2.2.2.2.1.1.1.2.3.3.2" mathsize="90%" xref="S3.E7.m2.2.2.2.2.1.1.1.2.3.3.2.cmml">s</mi><mo id="S3.E7.m2.2.2.2.2.1.1.1.2.3.3.1" xref="S3.E7.m2.2.2.2.2.1.1.1.2.3.3.1.cmml">⁢</mo><mi id="S3.E7.m2.2.2.2.2.1.1.1.2.3.3.3" mathsize="90%" xref="S3.E7.m2.2.2.2.2.1.1.1.2.3.3.3.cmml">p</mi><mo id="S3.E7.m2.2.2.2.2.1.1.1.2.3.3.1a" xref="S3.E7.m2.2.2.2.2.1.1.1.2.3.3.1.cmml">⁢</mo><mi id="S3.E7.m2.2.2.2.2.1.1.1.2.3.3.4" mathsize="90%" xref="S3.E7.m2.2.2.2.2.1.1.1.2.3.3.4.cmml">o</mi><mo id="S3.E7.m2.2.2.2.2.1.1.1.2.3.3.1b" xref="S3.E7.m2.2.2.2.2.1.1.1.2.3.3.1.cmml">⁢</mo><mi id="S3.E7.m2.2.2.2.2.1.1.1.2.3.3.5" mathsize="90%" xref="S3.E7.m2.2.2.2.2.1.1.1.2.3.3.5.cmml">o</mi><mo id="S3.E7.m2.2.2.2.2.1.1.1.2.3.3.1c" xref="S3.E7.m2.2.2.2.2.1.1.1.2.3.3.1.cmml">⁢</mo><mi id="S3.E7.m2.2.2.2.2.1.1.1.2.3.3.6" mathsize="90%" xref="S3.E7.m2.2.2.2.2.1.1.1.2.3.3.6.cmml">f</mi></mrow></msub></mrow></mrow><mo id="S3.E7.m2.2.2.2.2.1.1.3" maxsize="90%" minsize="90%" xref="S3.E7.m2.2.2.2.2.2.cmml">)</mo></mrow></mrow></mrow></mfrac></mstyle></mrow><mo id="S3.E7.m2.3.3.1.2" mathsize="90%" xref="S3.E7.m2.3.3.1.1.cmml">,</mo></mrow><annotation-xml encoding="MathML-Content" id="S3.E7.m2.3b"><apply id="S3.E7.m2.3.3.1.1.cmml" xref="S3.E7.m2.3.3.1"><eq id="S3.E7.m2.3.3.1.1.1.cmml" xref="S3.E7.m2.3.3.1.1.1"></eq><csymbol cd="latexml" id="S3.E7.m2.3.3.1.1.2.cmml" xref="S3.E7.m2.3.3.1.1.2">absent</csymbol><apply id="S3.E7.m2.2.2.cmml" xref="S3.E7.m2.2.2"><divide id="S3.E7.m2.2.2.3.cmml" xref="S3.E7.m2.2.2"></divide><cn id="S3.E7.m2.2.2.4.cmml" type="integer" xref="S3.E7.m2.2.2.4">1</cn><apply id="S3.E7.m2.2.2.2.cmml" xref="S3.E7.m2.2.2.2"><plus id="S3.E7.m2.2.2.2.3.cmml" xref="S3.E7.m2.2.2.2.3"></plus><cn id="S3.E7.m2.2.2.2.4.cmml" type="integer" xref="S3.E7.m2.2.2.2.4">1</cn><apply id="S3.E7.m2.2.2.2.2.2.cmml" xref="S3.E7.m2.2.2.2.2.1"><exp id="S3.E7.m2.1.1.1.1.cmml" xref="S3.E7.m2.1.1.1.1"></exp><apply id="S3.E7.m2.2.2.2.2.1.1.1.cmml" xref="S3.E7.m2.2.2.2.2.1.1.1"><minus id="S3.E7.m2.2.2.2.2.1.1.1.1.cmml" xref="S3.E7.m2.2.2.2.2.1.1.1"></minus><apply id="S3.E7.m2.2.2.2.2.1.1.1.2.cmml" xref="S3.E7.m2.2.2.2.2.1.1.1.2"><times id="S3.E7.m2.2.2.2.2.1.1.1.2.1.cmml" xref="S3.E7.m2.2.2.2.2.1.1.1.2.1"></times><apply id="S3.E7.m2.2.2.2.2.1.1.1.2.2.cmml" xref="S3.E7.m2.2.2.2.2.1.1.1.2.2"><csymbol cd="ambiguous" id="S3.E7.m2.2.2.2.2.1.1.1.2.2.1.cmml" xref="S3.E7.m2.2.2.2.2.1.1.1.2.2">superscript</csymbol><apply id="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.cmml" xref="S3.E7.m2.2.2.2.2.1.1.1.2.2"><csymbol cd="ambiguous" id="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.1.cmml" xref="S3.E7.m2.2.2.2.2.1.1.1.2.2">subscript</csymbol><ci id="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.2.cmml" xref="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.2">𝒆</ci><apply id="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.3.cmml" xref="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.3"><times id="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.3.1.cmml" xref="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.3.1"></times><ci id="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.3.2.cmml" xref="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.3.2">𝑠</ci><ci id="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.3.3.cmml" xref="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.3.3">𝑝</ci><ci id="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.3.4.cmml" xref="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.3.4">𝑜</ci><ci id="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.3.5.cmml" xref="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.3.5">𝑜</ci><ci id="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.3.6.cmml" xref="S3.E7.m2.2.2.2.2.1.1.1.2.2.2.3.6">𝑓</ci></apply></apply><csymbol cd="latexml" id="S3.E7.m2.2.2.2.2.1.1.1.2.2.3.cmml" xref="S3.E7.m2.2.2.2.2.1.1.1.2.2.3">top</csymbol></apply><apply id="S3.E7.m2.2.2.2.2.1.1.1.2.3.cmml" xref="S3.E7.m2.2.2.2.2.1.1.1.2.3"><csymbol cd="ambiguous" id="S3.E7.m2.2.2.2.2.1.1.1.2.3.1.cmml" xref="S3.E7.m2.2.2.2.2.1.1.1.2.3">subscript</csymbol><ci id="S3.E7.m2.2.2.2.2.1.1.1.2.3.2.cmml" xref="S3.E7.m2.2.2.2.2.1.1.1.2.3.2">𝜽</ci><apply id="S3.E7.m2.2.2.2.2.1.1.1.2.3.3.cmml" xref="S3.E7.m2.2.2.2.2.1.1.1.2.3.3"><times id="S3.E7.m2.2.2.2.2.1.1.1.2.3.3.1.cmml" xref="S3.E7.m2.2.2.2.2.1.1.1.2.3.3.1"></times><ci id="S3.E7.m2.2.2.2.2.1.1.1.2.3.3.2.cmml" xref="S3.E7.m2.2.2.2.2.1.1.1.2.3.3.2">𝑠</ci><ci id="S3.E7.m2.2.2.2.2.1.1.1.2.3.3.3.cmml" xref="S3.E7.m2.2.2.2.2.1.1.1.2.3.3.3">𝑝</ci><ci id="S3.E7.m2.2.2.2.2.1.1.1.2.3.3.4.cmml" xref="S3.E7.m2.2.2.2.2.1.1.1.2.3.3.4">𝑜</ci><ci id="S3.E7.m2.2.2.2.2.1.1.1.2.3.3.5.cmml" xref="S3.E7.m2.2.2.2.2.1.1.1.2.3.3.5">𝑜</ci><ci id="S3.E7.m2.2.2.2.2.1.1.1.2.3.3.6.cmml" xref="S3.E7.m2.2.2.2.2.1.1.1.2.3.3.6">𝑓</ci></apply></apply></apply></apply></apply></apply></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.E7.m2.3c">\displaystyle=\frac{1}{1+\exp(-\boldsymbol{e}_{spoof}^{\top}\boldsymbol{\theta% }_{spoof})},</annotation><annotation encoding="application/x-llamapun" id="S3.E7.m2.3d">= divide start_ARG 1 end_ARG start_ARG 1 + roman_exp ( - bold_italic_e start_POSTSUBSCRIPT italic_s italic_p italic_o italic_o italic_f end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_italic_θ start_POSTSUBSCRIPT italic_s italic_p italic_o italic_o italic_f end_POSTSUBSCRIPT ) end_ARG ,</annotation></semantics></math></td> <td class="ltx_eqn_cell ltx_eqn_center_padright"></td> <td class="ltx_eqn_cell ltx_eqn_eqno ltx_align_middle ltx_align_right" rowspan="1"><span class="ltx_tag ltx_tag_equation ltx_align_right">(7)</span></td> </tr></tbody> <tbody id="S3.Ex1"><tr class="ltx_equation ltx_eqn_row ltx_align_baseline"> <td class="ltx_eqn_cell ltx_eqn_center_padleft"></td> <td class="ltx_td ltx_align_right ltx_eqn_cell"><math alttext="\displaystyle\mathcal{L}_{spoof}" class="ltx_Math" display="inline" id="S3.Ex1.m1.1"><semantics id="S3.Ex1.m1.1a"><msub id="S3.Ex1.m1.1.1" xref="S3.Ex1.m1.1.1.cmml"><mi class="ltx_font_mathcaligraphic" id="S3.Ex1.m1.1.1.2" mathsize="90%" xref="S3.Ex1.m1.1.1.2.cmml">ℒ</mi><mrow id="S3.Ex1.m1.1.1.3" xref="S3.Ex1.m1.1.1.3.cmml"><mi id="S3.Ex1.m1.1.1.3.2" mathsize="90%" xref="S3.Ex1.m1.1.1.3.2.cmml">s</mi><mo id="S3.Ex1.m1.1.1.3.1" xref="S3.Ex1.m1.1.1.3.1.cmml">⁢</mo><mi id="S3.Ex1.m1.1.1.3.3" mathsize="90%" xref="S3.Ex1.m1.1.1.3.3.cmml">p</mi><mo id="S3.Ex1.m1.1.1.3.1a" xref="S3.Ex1.m1.1.1.3.1.cmml">⁢</mo><mi id="S3.Ex1.m1.1.1.3.4" mathsize="90%" xref="S3.Ex1.m1.1.1.3.4.cmml">o</mi><mo id="S3.Ex1.m1.1.1.3.1b" xref="S3.Ex1.m1.1.1.3.1.cmml">⁢</mo><mi id="S3.Ex1.m1.1.1.3.5" mathsize="90%" xref="S3.Ex1.m1.1.1.3.5.cmml">o</mi><mo id="S3.Ex1.m1.1.1.3.1c" xref="S3.Ex1.m1.1.1.3.1.cmml">⁢</mo><mi id="S3.Ex1.m1.1.1.3.6" mathsize="90%" xref="S3.Ex1.m1.1.1.3.6.cmml">f</mi></mrow></msub><annotation-xml encoding="MathML-Content" id="S3.Ex1.m1.1b"><apply id="S3.Ex1.m1.1.1.cmml" xref="S3.Ex1.m1.1.1"><csymbol cd="ambiguous" id="S3.Ex1.m1.1.1.1.cmml" xref="S3.Ex1.m1.1.1">subscript</csymbol><ci id="S3.Ex1.m1.1.1.2.cmml" xref="S3.Ex1.m1.1.1.2">ℒ</ci><apply id="S3.Ex1.m1.1.1.3.cmml" xref="S3.Ex1.m1.1.1.3"><times id="S3.Ex1.m1.1.1.3.1.cmml" xref="S3.Ex1.m1.1.1.3.1"></times><ci id="S3.Ex1.m1.1.1.3.2.cmml" xref="S3.Ex1.m1.1.1.3.2">𝑠</ci><ci id="S3.Ex1.m1.1.1.3.3.cmml" xref="S3.Ex1.m1.1.1.3.3">𝑝</ci><ci id="S3.Ex1.m1.1.1.3.4.cmml" xref="S3.Ex1.m1.1.1.3.4">𝑜</ci><ci id="S3.Ex1.m1.1.1.3.5.cmml" xref="S3.Ex1.m1.1.1.3.5">𝑜</ci><ci id="S3.Ex1.m1.1.1.3.6.cmml" xref="S3.Ex1.m1.1.1.3.6">𝑓</ci></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.Ex1.m1.1c">\displaystyle\mathcal{L}_{spoof}</annotation><annotation encoding="application/x-llamapun" id="S3.Ex1.m1.1d">caligraphic_L start_POSTSUBSCRIPT italic_s italic_p italic_o italic_o italic_f end_POSTSUBSCRIPT</annotation></semantics></math></td> <td class="ltx_td ltx_align_left ltx_eqn_cell"><math alttext="\displaystyle=-\frac{1}{M}\sum_{m}^{M}[\mathcal{I}(y^{m}_{spoof}=1)\log P(% \boldsymbol{e}^{m}_{spoof})" class="ltx_math_unparsed" display="inline" id="S3.Ex1.m2.1"><semantics id="S3.Ex1.m2.1a"><mrow id="S3.Ex1.m2.1b"><mo id="S3.Ex1.m2.1.1" mathsize="90%" rspace="0em">=</mo><mo id="S3.Ex1.m2.1.2" lspace="0em" mathsize="90%">−</mo><mstyle displaystyle="true" id="S3.Ex1.m2.1.3"><mfrac id="S3.Ex1.m2.1.3a"><mn id="S3.Ex1.m2.1.3.2" mathsize="90%">1</mn><mi id="S3.Ex1.m2.1.3.3" mathsize="90%">M</mi></mfrac></mstyle><mstyle displaystyle="true" id="S3.Ex1.m2.1.4"><munderover id="S3.Ex1.m2.1.4a"><mo id="S3.Ex1.m2.1.4.2.2" maxsize="90%" minsize="90%" movablelimits="false" stretchy="true">∑</mo><mi id="S3.Ex1.m2.1.4.2.3" mathsize="90%">m</mi><mi id="S3.Ex1.m2.1.4.3" mathsize="90%">M</mi></munderover></mstyle><mrow id="S3.Ex1.m2.1.5"><mo id="S3.Ex1.m2.1.5.1" maxsize="90%" minsize="90%">[</mo><mi class="ltx_font_mathcaligraphic" id="S3.Ex1.m2.1.5.2" mathsize="90%">ℐ</mi><mrow id="S3.Ex1.m2.1.5.3"><mo id="S3.Ex1.m2.1.5.3.1" maxsize="90%" minsize="90%">(</mo><msubsup id="S3.Ex1.m2.1.5.3.2"><mi id="S3.Ex1.m2.1.5.3.2.2.2" mathsize="90%">y</mi><mrow id="S3.Ex1.m2.1.5.3.2.3"><mi id="S3.Ex1.m2.1.5.3.2.3.2" mathsize="90%">s</mi><mo id="S3.Ex1.m2.1.5.3.2.3.1">⁢</mo><mi id="S3.Ex1.m2.1.5.3.2.3.3" mathsize="90%">p</mi><mo id="S3.Ex1.m2.1.5.3.2.3.1a">⁢</mo><mi id="S3.Ex1.m2.1.5.3.2.3.4" mathsize="90%">o</mi><mo id="S3.Ex1.m2.1.5.3.2.3.1b">⁢</mo><mi id="S3.Ex1.m2.1.5.3.2.3.5" mathsize="90%">o</mi><mo id="S3.Ex1.m2.1.5.3.2.3.1c">⁢</mo><mi id="S3.Ex1.m2.1.5.3.2.3.6" mathsize="90%">f</mi></mrow><mi id="S3.Ex1.m2.1.5.3.2.2.3" mathsize="90%">m</mi></msubsup><mo id="S3.Ex1.m2.1.5.3.3" mathsize="90%">=</mo><mn id="S3.Ex1.m2.1.5.3.4" mathsize="90%">1</mn><mo id="S3.Ex1.m2.1.5.3.5" maxsize="90%" minsize="90%" rspace="0.167em">)</mo></mrow><mi id="S3.Ex1.m2.1.5.4" mathsize="90%">log</mi><mi id="S3.Ex1.m2.1.5.5" mathsize="90%">P</mi><mrow id="S3.Ex1.m2.1.5.6"><mo id="S3.Ex1.m2.1.5.6.1" maxsize="90%" minsize="90%">(</mo><msubsup id="S3.Ex1.m2.1.5.6.2"><mi id="S3.Ex1.m2.1.5.6.2.2.2" mathsize="90%">𝒆</mi><mrow id="S3.Ex1.m2.1.5.6.2.3"><mi id="S3.Ex1.m2.1.5.6.2.3.2" mathsize="90%">s</mi><mo id="S3.Ex1.m2.1.5.6.2.3.1">⁢</mo><mi id="S3.Ex1.m2.1.5.6.2.3.3" mathsize="90%">p</mi><mo id="S3.Ex1.m2.1.5.6.2.3.1a">⁢</mo><mi id="S3.Ex1.m2.1.5.6.2.3.4" mathsize="90%">o</mi><mo id="S3.Ex1.m2.1.5.6.2.3.1b">⁢</mo><mi id="S3.Ex1.m2.1.5.6.2.3.5" mathsize="90%">o</mi><mo id="S3.Ex1.m2.1.5.6.2.3.1c">⁢</mo><mi id="S3.Ex1.m2.1.5.6.2.3.6" mathsize="90%">f</mi></mrow><mi id="S3.Ex1.m2.1.5.6.2.2.3" mathsize="90%">m</mi></msubsup><mo id="S3.Ex1.m2.1.5.6.3" maxsize="90%" minsize="90%">)</mo></mrow></mrow></mrow><annotation encoding="application/x-tex" id="S3.Ex1.m2.1c">\displaystyle=-\frac{1}{M}\sum_{m}^{M}[\mathcal{I}(y^{m}_{spoof}=1)\log P(% \boldsymbol{e}^{m}_{spoof})</annotation><annotation encoding="application/x-llamapun" id="S3.Ex1.m2.1d">= - divide start_ARG 1 end_ARG start_ARG italic_M end_ARG ∑ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT [ caligraphic_I ( italic_y start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s italic_p italic_o italic_o italic_f end_POSTSUBSCRIPT = 1 ) roman_log italic_P ( bold_italic_e start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s italic_p italic_o italic_o italic_f end_POSTSUBSCRIPT )</annotation></semantics></math></td> <td class="ltx_eqn_cell ltx_eqn_center_padright"></td> </tr></tbody> <tbody id="S3.E8"><tr class="ltx_equation ltx_eqn_row ltx_align_baseline"> <td class="ltx_eqn_cell ltx_eqn_center_padleft"></td> <td class="ltx_td ltx_eqn_cell"></td> <td class="ltx_td ltx_align_left ltx_eqn_cell"><math alttext="\displaystyle+\mathcal{I}(y^{m}_{spoof}\neq 1)\log(1-P(\boldsymbol{e}^{m}_{% spoof}))]," class="ltx_math_unparsed" display="inline" id="S3.E8.m1.1"><semantics id="S3.E8.m1.1a"><mrow id="S3.E8.m1.1b"><mo id="S3.E8.m1.1.2" mathsize="90%">+</mo><mi class="ltx_font_mathcaligraphic" id="S3.E8.m1.1.3" mathsize="90%">ℐ</mi><mrow id="S3.E8.m1.1.4"><mo id="S3.E8.m1.1.4.1" maxsize="90%" minsize="90%">(</mo><msubsup id="S3.E8.m1.1.4.2"><mi id="S3.E8.m1.1.4.2.2.2" mathsize="90%">y</mi><mrow id="S3.E8.m1.1.4.2.3"><mi id="S3.E8.m1.1.4.2.3.2" mathsize="90%">s</mi><mo id="S3.E8.m1.1.4.2.3.1">⁢</mo><mi id="S3.E8.m1.1.4.2.3.3" mathsize="90%">p</mi><mo id="S3.E8.m1.1.4.2.3.1a">⁢</mo><mi id="S3.E8.m1.1.4.2.3.4" mathsize="90%">o</mi><mo id="S3.E8.m1.1.4.2.3.1b">⁢</mo><mi id="S3.E8.m1.1.4.2.3.5" mathsize="90%">o</mi><mo id="S3.E8.m1.1.4.2.3.1c">⁢</mo><mi id="S3.E8.m1.1.4.2.3.6" mathsize="90%">f</mi></mrow><mi id="S3.E8.m1.1.4.2.2.3" mathsize="90%">m</mi></msubsup><mo id="S3.E8.m1.1.4.3" mathsize="90%">≠</mo><mn id="S3.E8.m1.1.4.4" mathsize="90%">1</mn><mo id="S3.E8.m1.1.4.5" maxsize="90%" minsize="90%" rspace="0.167em">)</mo></mrow><mi id="S3.E8.m1.1.1" mathsize="90%">log</mi><mrow id="S3.E8.m1.1.5"><mo id="S3.E8.m1.1.5.1" maxsize="90%" minsize="90%">(</mo><mn id="S3.E8.m1.1.5.2" mathsize="90%">1</mn><mo id="S3.E8.m1.1.5.3" mathsize="90%">−</mo><mi id="S3.E8.m1.1.5.4" mathsize="90%">P</mi><mrow id="S3.E8.m1.1.5.5"><mo id="S3.E8.m1.1.5.5.1" maxsize="90%" minsize="90%">(</mo><msubsup id="S3.E8.m1.1.5.5.2"><mi id="S3.E8.m1.1.5.5.2.2.2" mathsize="90%">𝒆</mi><mrow id="S3.E8.m1.1.5.5.2.3"><mi id="S3.E8.m1.1.5.5.2.3.2" mathsize="90%">s</mi><mo id="S3.E8.m1.1.5.5.2.3.1">⁢</mo><mi id="S3.E8.m1.1.5.5.2.3.3" mathsize="90%">p</mi><mo id="S3.E8.m1.1.5.5.2.3.1a">⁢</mo><mi id="S3.E8.m1.1.5.5.2.3.4" mathsize="90%">o</mi><mo id="S3.E8.m1.1.5.5.2.3.1b">⁢</mo><mi id="S3.E8.m1.1.5.5.2.3.5" mathsize="90%">o</mi><mo id="S3.E8.m1.1.5.5.2.3.1c">⁢</mo><mi id="S3.E8.m1.1.5.5.2.3.6" mathsize="90%">f</mi></mrow><mi id="S3.E8.m1.1.5.5.2.2.3" mathsize="90%">m</mi></msubsup><mo id="S3.E8.m1.1.5.5.3" maxsize="90%" minsize="90%">)</mo></mrow><mo id="S3.E8.m1.1.5.6" maxsize="90%" minsize="90%">)</mo></mrow><mo id="S3.E8.m1.1.6" maxsize="90%" minsize="90%">]</mo><mo id="S3.E8.m1.1.7" mathsize="90%">,</mo></mrow><annotation encoding="application/x-tex" id="S3.E8.m1.1c">\displaystyle+\mathcal{I}(y^{m}_{spoof}\neq 1)\log(1-P(\boldsymbol{e}^{m}_{% spoof}))],</annotation><annotation encoding="application/x-llamapun" id="S3.E8.m1.1d">+ caligraphic_I ( italic_y start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s italic_p italic_o italic_o italic_f end_POSTSUBSCRIPT ≠ 1 ) roman_log ( 1 - italic_P ( bold_italic_e start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s italic_p italic_o italic_o italic_f end_POSTSUBSCRIPT ) ) ] ,</annotation></semantics></math></td> <td class="ltx_eqn_cell ltx_eqn_center_padright"></td> <td class="ltx_eqn_cell ltx_eqn_eqno ltx_align_middle ltx_align_right" rowspan="1"><span class="ltx_tag ltx_tag_equation ltx_align_right">(8)</span></td> </tr></tbody> <tbody id="S3.E9"><tr class="ltx_equation ltx_eqn_row ltx_align_baseline"> <td class="ltx_eqn_cell ltx_eqn_center_padleft"></td> <td class="ltx_td ltx_align_right ltx_eqn_cell"><math alttext="\displaystyle P(\boldsymbol{e}_{asv},s_{i})" class="ltx_Math" display="inline" id="S3.E9.m1.2"><semantics id="S3.E9.m1.2a"><mrow id="S3.E9.m1.2.2" xref="S3.E9.m1.2.2.cmml"><mi id="S3.E9.m1.2.2.4" mathsize="90%" xref="S3.E9.m1.2.2.4.cmml">P</mi><mo id="S3.E9.m1.2.2.3" xref="S3.E9.m1.2.2.3.cmml">⁢</mo><mrow id="S3.E9.m1.2.2.2.2" xref="S3.E9.m1.2.2.2.3.cmml"><mo id="S3.E9.m1.2.2.2.2.3" maxsize="90%" minsize="90%" xref="S3.E9.m1.2.2.2.3.cmml">(</mo><msub id="S3.E9.m1.1.1.1.1.1" xref="S3.E9.m1.1.1.1.1.1.cmml"><mi id="S3.E9.m1.1.1.1.1.1.2" mathsize="90%" xref="S3.E9.m1.1.1.1.1.1.2.cmml">𝒆</mi><mrow id="S3.E9.m1.1.1.1.1.1.3" xref="S3.E9.m1.1.1.1.1.1.3.cmml"><mi id="S3.E9.m1.1.1.1.1.1.3.2" mathsize="90%" xref="S3.E9.m1.1.1.1.1.1.3.2.cmml">a</mi><mo id="S3.E9.m1.1.1.1.1.1.3.1" xref="S3.E9.m1.1.1.1.1.1.3.1.cmml">⁢</mo><mi id="S3.E9.m1.1.1.1.1.1.3.3" mathsize="90%" xref="S3.E9.m1.1.1.1.1.1.3.3.cmml">s</mi><mo id="S3.E9.m1.1.1.1.1.1.3.1a" xref="S3.E9.m1.1.1.1.1.1.3.1.cmml">⁢</mo><mi id="S3.E9.m1.1.1.1.1.1.3.4" mathsize="90%" xref="S3.E9.m1.1.1.1.1.1.3.4.cmml">v</mi></mrow></msub><mo id="S3.E9.m1.2.2.2.2.4" mathsize="90%" xref="S3.E9.m1.2.2.2.3.cmml">,</mo><msub id="S3.E9.m1.2.2.2.2.2" xref="S3.E9.m1.2.2.2.2.2.cmml"><mi id="S3.E9.m1.2.2.2.2.2.2" mathsize="90%" xref="S3.E9.m1.2.2.2.2.2.2.cmml">s</mi><mi id="S3.E9.m1.2.2.2.2.2.3" mathsize="90%" xref="S3.E9.m1.2.2.2.2.2.3.cmml">i</mi></msub><mo id="S3.E9.m1.2.2.2.2.5" maxsize="90%" minsize="90%" xref="S3.E9.m1.2.2.2.3.cmml">)</mo></mrow></mrow><annotation-xml encoding="MathML-Content" id="S3.E9.m1.2b"><apply id="S3.E9.m1.2.2.cmml" xref="S3.E9.m1.2.2"><times id="S3.E9.m1.2.2.3.cmml" xref="S3.E9.m1.2.2.3"></times><ci id="S3.E9.m1.2.2.4.cmml" xref="S3.E9.m1.2.2.4">𝑃</ci><interval closure="open" id="S3.E9.m1.2.2.2.3.cmml" xref="S3.E9.m1.2.2.2.2"><apply id="S3.E9.m1.1.1.1.1.1.cmml" xref="S3.E9.m1.1.1.1.1.1"><csymbol cd="ambiguous" id="S3.E9.m1.1.1.1.1.1.1.cmml" xref="S3.E9.m1.1.1.1.1.1">subscript</csymbol><ci id="S3.E9.m1.1.1.1.1.1.2.cmml" xref="S3.E9.m1.1.1.1.1.1.2">𝒆</ci><apply id="S3.E9.m1.1.1.1.1.1.3.cmml" xref="S3.E9.m1.1.1.1.1.1.3"><times id="S3.E9.m1.1.1.1.1.1.3.1.cmml" xref="S3.E9.m1.1.1.1.1.1.3.1"></times><ci id="S3.E9.m1.1.1.1.1.1.3.2.cmml" xref="S3.E9.m1.1.1.1.1.1.3.2">𝑎</ci><ci id="S3.E9.m1.1.1.1.1.1.3.3.cmml" xref="S3.E9.m1.1.1.1.1.1.3.3">𝑠</ci><ci id="S3.E9.m1.1.1.1.1.1.3.4.cmml" xref="S3.E9.m1.1.1.1.1.1.3.4">𝑣</ci></apply></apply><apply id="S3.E9.m1.2.2.2.2.2.cmml" xref="S3.E9.m1.2.2.2.2.2"><csymbol cd="ambiguous" id="S3.E9.m1.2.2.2.2.2.1.cmml" xref="S3.E9.m1.2.2.2.2.2">subscript</csymbol><ci id="S3.E9.m1.2.2.2.2.2.2.cmml" xref="S3.E9.m1.2.2.2.2.2.2">𝑠</ci><ci id="S3.E9.m1.2.2.2.2.2.3.cmml" xref="S3.E9.m1.2.2.2.2.2.3">𝑖</ci></apply></interval></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.E9.m1.2c">\displaystyle P(\boldsymbol{e}_{asv},s_{i})</annotation><annotation encoding="application/x-llamapun" id="S3.E9.m1.2d">italic_P ( bold_italic_e start_POSTSUBSCRIPT italic_a italic_s italic_v end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT )</annotation></semantics></math></td> <td class="ltx_td ltx_align_left ltx_eqn_cell"><math alttext="\displaystyle=\frac{\exp(h_{\boldsymbol{\theta}_{asv}^{s_{i}}}(\boldsymbol{e}_% {asv}))}{\sum_{j=1}^{S}\exp(h_{\boldsymbol{\theta}_{asv}^{s_{j}}}(\boldsymbol{% e}_{asv}))}," class="ltx_Math" display="inline" id="S3.E9.m2.5"><semantics id="S3.E9.m2.5a"><mrow id="S3.E9.m2.5.5.1" xref="S3.E9.m2.5.5.1.1.cmml"><mrow id="S3.E9.m2.5.5.1.1" xref="S3.E9.m2.5.5.1.1.cmml"><mi id="S3.E9.m2.5.5.1.1.2" xref="S3.E9.m2.5.5.1.1.2.cmml"></mi><mo id="S3.E9.m2.5.5.1.1.1" mathsize="90%" xref="S3.E9.m2.5.5.1.1.1.cmml">=</mo><mstyle displaystyle="true" id="S3.E9.m2.4.4" xref="S3.E9.m2.4.4.cmml"><mfrac id="S3.E9.m2.4.4a" xref="S3.E9.m2.4.4.cmml"><mrow id="S3.E9.m2.2.2.2.2" xref="S3.E9.m2.2.2.2.3.cmml"><mi id="S3.E9.m2.1.1.1.1" mathsize="90%" xref="S3.E9.m2.1.1.1.1.cmml">exp</mi><mo id="S3.E9.m2.2.2.2.2a" xref="S3.E9.m2.2.2.2.3.cmml">⁡</mo><mrow id="S3.E9.m2.2.2.2.2.1" xref="S3.E9.m2.2.2.2.3.cmml"><mo id="S3.E9.m2.2.2.2.2.1.2" maxsize="90%" minsize="90%" xref="S3.E9.m2.2.2.2.3.cmml">(</mo><mrow id="S3.E9.m2.2.2.2.2.1.1" xref="S3.E9.m2.2.2.2.2.1.1.cmml"><msub id="S3.E9.m2.2.2.2.2.1.1.3" xref="S3.E9.m2.2.2.2.2.1.1.3.cmml"><mi id="S3.E9.m2.2.2.2.2.1.1.3.2" mathsize="90%" xref="S3.E9.m2.2.2.2.2.1.1.3.2.cmml">h</mi><msubsup id="S3.E9.m2.2.2.2.2.1.1.3.3" xref="S3.E9.m2.2.2.2.2.1.1.3.3.cmml"><mi id="S3.E9.m2.2.2.2.2.1.1.3.3.2.2" mathsize="90%" xref="S3.E9.m2.2.2.2.2.1.1.3.3.2.2.cmml">𝜽</mi><mrow id="S3.E9.m2.2.2.2.2.1.1.3.3.2.3" xref="S3.E9.m2.2.2.2.2.1.1.3.3.2.3.cmml"><mi id="S3.E9.m2.2.2.2.2.1.1.3.3.2.3.2" mathsize="90%" xref="S3.E9.m2.2.2.2.2.1.1.3.3.2.3.2.cmml">a</mi><mo id="S3.E9.m2.2.2.2.2.1.1.3.3.2.3.1" xref="S3.E9.m2.2.2.2.2.1.1.3.3.2.3.1.cmml">⁢</mo><mi id="S3.E9.m2.2.2.2.2.1.1.3.3.2.3.3" mathsize="90%" xref="S3.E9.m2.2.2.2.2.1.1.3.3.2.3.3.cmml">s</mi><mo id="S3.E9.m2.2.2.2.2.1.1.3.3.2.3.1a" xref="S3.E9.m2.2.2.2.2.1.1.3.3.2.3.1.cmml">⁢</mo><mi id="S3.E9.m2.2.2.2.2.1.1.3.3.2.3.4" mathsize="90%" xref="S3.E9.m2.2.2.2.2.1.1.3.3.2.3.4.cmml">v</mi></mrow><msub id="S3.E9.m2.2.2.2.2.1.1.3.3.3" xref="S3.E9.m2.2.2.2.2.1.1.3.3.3.cmml"><mi id="S3.E9.m2.2.2.2.2.1.1.3.3.3.2" mathsize="90%" xref="S3.E9.m2.2.2.2.2.1.1.3.3.3.2.cmml">s</mi><mi id="S3.E9.m2.2.2.2.2.1.1.3.3.3.3" mathsize="90%" xref="S3.E9.m2.2.2.2.2.1.1.3.3.3.3.cmml">i</mi></msub></msubsup></msub><mo id="S3.E9.m2.2.2.2.2.1.1.2" xref="S3.E9.m2.2.2.2.2.1.1.2.cmml">⁢</mo><mrow id="S3.E9.m2.2.2.2.2.1.1.1.1" xref="S3.E9.m2.2.2.2.2.1.1.1.1.1.cmml"><mo id="S3.E9.m2.2.2.2.2.1.1.1.1.2" maxsize="90%" minsize="90%" xref="S3.E9.m2.2.2.2.2.1.1.1.1.1.cmml">(</mo><msub id="S3.E9.m2.2.2.2.2.1.1.1.1.1" xref="S3.E9.m2.2.2.2.2.1.1.1.1.1.cmml"><mi id="S3.E9.m2.2.2.2.2.1.1.1.1.1.2" mathsize="90%" xref="S3.E9.m2.2.2.2.2.1.1.1.1.1.2.cmml">𝒆</mi><mrow id="S3.E9.m2.2.2.2.2.1.1.1.1.1.3" xref="S3.E9.m2.2.2.2.2.1.1.1.1.1.3.cmml"><mi id="S3.E9.m2.2.2.2.2.1.1.1.1.1.3.2" mathsize="90%" xref="S3.E9.m2.2.2.2.2.1.1.1.1.1.3.2.cmml">a</mi><mo id="S3.E9.m2.2.2.2.2.1.1.1.1.1.3.1" xref="S3.E9.m2.2.2.2.2.1.1.1.1.1.3.1.cmml">⁢</mo><mi id="S3.E9.m2.2.2.2.2.1.1.1.1.1.3.3" mathsize="90%" xref="S3.E9.m2.2.2.2.2.1.1.1.1.1.3.3.cmml">s</mi><mo id="S3.E9.m2.2.2.2.2.1.1.1.1.1.3.1a" xref="S3.E9.m2.2.2.2.2.1.1.1.1.1.3.1.cmml">⁢</mo><mi id="S3.E9.m2.2.2.2.2.1.1.1.1.1.3.4" mathsize="90%" xref="S3.E9.m2.2.2.2.2.1.1.1.1.1.3.4.cmml">v</mi></mrow></msub><mo id="S3.E9.m2.2.2.2.2.1.1.1.1.3" maxsize="90%" minsize="90%" xref="S3.E9.m2.2.2.2.2.1.1.1.1.1.cmml">)</mo></mrow></mrow><mo id="S3.E9.m2.2.2.2.2.1.3" maxsize="90%" minsize="90%" xref="S3.E9.m2.2.2.2.3.cmml">)</mo></mrow></mrow><mrow id="S3.E9.m2.4.4.4" xref="S3.E9.m2.4.4.4.cmml"><msubsup id="S3.E9.m2.4.4.4.3" xref="S3.E9.m2.4.4.4.3.cmml"><mo id="S3.E9.m2.4.4.4.3.2.2" maxsize="90%" minsize="90%" stretchy="true" xref="S3.E9.m2.4.4.4.3.2.2.cmml">∑</mo><mrow id="S3.E9.m2.4.4.4.3.2.3" xref="S3.E9.m2.4.4.4.3.2.3.cmml"><mi id="S3.E9.m2.4.4.4.3.2.3.2" mathsize="90%" xref="S3.E9.m2.4.4.4.3.2.3.2.cmml">j</mi><mo id="S3.E9.m2.4.4.4.3.2.3.1" mathsize="90%" xref="S3.E9.m2.4.4.4.3.2.3.1.cmml">=</mo><mn id="S3.E9.m2.4.4.4.3.2.3.3" mathsize="90%" xref="S3.E9.m2.4.4.4.3.2.3.3.cmml">1</mn></mrow><mi id="S3.E9.m2.4.4.4.3.3" mathsize="90%" xref="S3.E9.m2.4.4.4.3.3.cmml">S</mi></msubsup><mrow id="S3.E9.m2.4.4.4.2.1" xref="S3.E9.m2.4.4.4.2.2.cmml"><mi id="S3.E9.m2.3.3.3.1" mathsize="90%" xref="S3.E9.m2.3.3.3.1.cmml">exp</mi><mo id="S3.E9.m2.4.4.4.2.1a" xref="S3.E9.m2.4.4.4.2.2.cmml">⁡</mo><mrow id="S3.E9.m2.4.4.4.2.1.1" xref="S3.E9.m2.4.4.4.2.2.cmml"><mo id="S3.E9.m2.4.4.4.2.1.1.2" maxsize="90%" minsize="90%" xref="S3.E9.m2.4.4.4.2.2.cmml">(</mo><mrow id="S3.E9.m2.4.4.4.2.1.1.1" xref="S3.E9.m2.4.4.4.2.1.1.1.cmml"><msub id="S3.E9.m2.4.4.4.2.1.1.1.3" xref="S3.E9.m2.4.4.4.2.1.1.1.3.cmml"><mi id="S3.E9.m2.4.4.4.2.1.1.1.3.2" mathsize="90%" xref="S3.E9.m2.4.4.4.2.1.1.1.3.2.cmml">h</mi><msubsup id="S3.E9.m2.4.4.4.2.1.1.1.3.3" xref="S3.E9.m2.4.4.4.2.1.1.1.3.3.cmml"><mi id="S3.E9.m2.4.4.4.2.1.1.1.3.3.2.2" mathsize="90%" xref="S3.E9.m2.4.4.4.2.1.1.1.3.3.2.2.cmml">𝜽</mi><mrow id="S3.E9.m2.4.4.4.2.1.1.1.3.3.2.3" xref="S3.E9.m2.4.4.4.2.1.1.1.3.3.2.3.cmml"><mi id="S3.E9.m2.4.4.4.2.1.1.1.3.3.2.3.2" mathsize="90%" xref="S3.E9.m2.4.4.4.2.1.1.1.3.3.2.3.2.cmml">a</mi><mo id="S3.E9.m2.4.4.4.2.1.1.1.3.3.2.3.1" xref="S3.E9.m2.4.4.4.2.1.1.1.3.3.2.3.1.cmml">⁢</mo><mi id="S3.E9.m2.4.4.4.2.1.1.1.3.3.2.3.3" mathsize="90%" xref="S3.E9.m2.4.4.4.2.1.1.1.3.3.2.3.3.cmml">s</mi><mo id="S3.E9.m2.4.4.4.2.1.1.1.3.3.2.3.1a" xref="S3.E9.m2.4.4.4.2.1.1.1.3.3.2.3.1.cmml">⁢</mo><mi id="S3.E9.m2.4.4.4.2.1.1.1.3.3.2.3.4" mathsize="90%" xref="S3.E9.m2.4.4.4.2.1.1.1.3.3.2.3.4.cmml">v</mi></mrow><msub id="S3.E9.m2.4.4.4.2.1.1.1.3.3.3" xref="S3.E9.m2.4.4.4.2.1.1.1.3.3.3.cmml"><mi id="S3.E9.m2.4.4.4.2.1.1.1.3.3.3.2" mathsize="90%" xref="S3.E9.m2.4.4.4.2.1.1.1.3.3.3.2.cmml">s</mi><mi id="S3.E9.m2.4.4.4.2.1.1.1.3.3.3.3" mathsize="90%" xref="S3.E9.m2.4.4.4.2.1.1.1.3.3.3.3.cmml">j</mi></msub></msubsup></msub><mo id="S3.E9.m2.4.4.4.2.1.1.1.2" xref="S3.E9.m2.4.4.4.2.1.1.1.2.cmml">⁢</mo><mrow id="S3.E9.m2.4.4.4.2.1.1.1.1.1" xref="S3.E9.m2.4.4.4.2.1.1.1.1.1.1.cmml"><mo id="S3.E9.m2.4.4.4.2.1.1.1.1.1.2" maxsize="90%" minsize="90%" xref="S3.E9.m2.4.4.4.2.1.1.1.1.1.1.cmml">(</mo><msub id="S3.E9.m2.4.4.4.2.1.1.1.1.1.1" xref="S3.E9.m2.4.4.4.2.1.1.1.1.1.1.cmml"><mi id="S3.E9.m2.4.4.4.2.1.1.1.1.1.1.2" mathsize="90%" xref="S3.E9.m2.4.4.4.2.1.1.1.1.1.1.2.cmml">𝒆</mi><mrow id="S3.E9.m2.4.4.4.2.1.1.1.1.1.1.3" xref="S3.E9.m2.4.4.4.2.1.1.1.1.1.1.3.cmml"><mi id="S3.E9.m2.4.4.4.2.1.1.1.1.1.1.3.2" mathsize="90%" xref="S3.E9.m2.4.4.4.2.1.1.1.1.1.1.3.2.cmml">a</mi><mo id="S3.E9.m2.4.4.4.2.1.1.1.1.1.1.3.1" xref="S3.E9.m2.4.4.4.2.1.1.1.1.1.1.3.1.cmml">⁢</mo><mi id="S3.E9.m2.4.4.4.2.1.1.1.1.1.1.3.3" mathsize="90%" xref="S3.E9.m2.4.4.4.2.1.1.1.1.1.1.3.3.cmml">s</mi><mo id="S3.E9.m2.4.4.4.2.1.1.1.1.1.1.3.1a" xref="S3.E9.m2.4.4.4.2.1.1.1.1.1.1.3.1.cmml">⁢</mo><mi id="S3.E9.m2.4.4.4.2.1.1.1.1.1.1.3.4" mathsize="90%" xref="S3.E9.m2.4.4.4.2.1.1.1.1.1.1.3.4.cmml">v</mi></mrow></msub><mo id="S3.E9.m2.4.4.4.2.1.1.1.1.1.3" maxsize="90%" minsize="90%" xref="S3.E9.m2.4.4.4.2.1.1.1.1.1.1.cmml">)</mo></mrow></mrow><mo id="S3.E9.m2.4.4.4.2.1.1.3" maxsize="90%" minsize="90%" xref="S3.E9.m2.4.4.4.2.2.cmml">)</mo></mrow></mrow></mrow></mfrac></mstyle></mrow><mo id="S3.E9.m2.5.5.1.2" mathsize="90%" xref="S3.E9.m2.5.5.1.1.cmml">,</mo></mrow><annotation-xml encoding="MathML-Content" id="S3.E9.m2.5b"><apply id="S3.E9.m2.5.5.1.1.cmml" xref="S3.E9.m2.5.5.1"><eq id="S3.E9.m2.5.5.1.1.1.cmml" xref="S3.E9.m2.5.5.1.1.1"></eq><csymbol cd="latexml" id="S3.E9.m2.5.5.1.1.2.cmml" xref="S3.E9.m2.5.5.1.1.2">absent</csymbol><apply id="S3.E9.m2.4.4.cmml" xref="S3.E9.m2.4.4"><divide id="S3.E9.m2.4.4.5.cmml" xref="S3.E9.m2.4.4"></divide><apply id="S3.E9.m2.2.2.2.3.cmml" xref="S3.E9.m2.2.2.2.2"><exp id="S3.E9.m2.1.1.1.1.cmml" xref="S3.E9.m2.1.1.1.1"></exp><apply id="S3.E9.m2.2.2.2.2.1.1.cmml" xref="S3.E9.m2.2.2.2.2.1.1"><times id="S3.E9.m2.2.2.2.2.1.1.2.cmml" xref="S3.E9.m2.2.2.2.2.1.1.2"></times><apply id="S3.E9.m2.2.2.2.2.1.1.3.cmml" xref="S3.E9.m2.2.2.2.2.1.1.3"><csymbol cd="ambiguous" id="S3.E9.m2.2.2.2.2.1.1.3.1.cmml" xref="S3.E9.m2.2.2.2.2.1.1.3">subscript</csymbol><ci id="S3.E9.m2.2.2.2.2.1.1.3.2.cmml" xref="S3.E9.m2.2.2.2.2.1.1.3.2">ℎ</ci><apply id="S3.E9.m2.2.2.2.2.1.1.3.3.cmml" xref="S3.E9.m2.2.2.2.2.1.1.3.3"><csymbol cd="ambiguous" id="S3.E9.m2.2.2.2.2.1.1.3.3.1.cmml" xref="S3.E9.m2.2.2.2.2.1.1.3.3">superscript</csymbol><apply id="S3.E9.m2.2.2.2.2.1.1.3.3.2.cmml" xref="S3.E9.m2.2.2.2.2.1.1.3.3"><csymbol cd="ambiguous" id="S3.E9.m2.2.2.2.2.1.1.3.3.2.1.cmml" xref="S3.E9.m2.2.2.2.2.1.1.3.3">subscript</csymbol><ci id="S3.E9.m2.2.2.2.2.1.1.3.3.2.2.cmml" xref="S3.E9.m2.2.2.2.2.1.1.3.3.2.2">𝜽</ci><apply id="S3.E9.m2.2.2.2.2.1.1.3.3.2.3.cmml" xref="S3.E9.m2.2.2.2.2.1.1.3.3.2.3"><times id="S3.E9.m2.2.2.2.2.1.1.3.3.2.3.1.cmml" xref="S3.E9.m2.2.2.2.2.1.1.3.3.2.3.1"></times><ci id="S3.E9.m2.2.2.2.2.1.1.3.3.2.3.2.cmml" xref="S3.E9.m2.2.2.2.2.1.1.3.3.2.3.2">𝑎</ci><ci id="S3.E9.m2.2.2.2.2.1.1.3.3.2.3.3.cmml" xref="S3.E9.m2.2.2.2.2.1.1.3.3.2.3.3">𝑠</ci><ci id="S3.E9.m2.2.2.2.2.1.1.3.3.2.3.4.cmml" xref="S3.E9.m2.2.2.2.2.1.1.3.3.2.3.4">𝑣</ci></apply></apply><apply id="S3.E9.m2.2.2.2.2.1.1.3.3.3.cmml" xref="S3.E9.m2.2.2.2.2.1.1.3.3.3"><csymbol cd="ambiguous" id="S3.E9.m2.2.2.2.2.1.1.3.3.3.1.cmml" xref="S3.E9.m2.2.2.2.2.1.1.3.3.3">subscript</csymbol><ci id="S3.E9.m2.2.2.2.2.1.1.3.3.3.2.cmml" xref="S3.E9.m2.2.2.2.2.1.1.3.3.3.2">𝑠</ci><ci id="S3.E9.m2.2.2.2.2.1.1.3.3.3.3.cmml" xref="S3.E9.m2.2.2.2.2.1.1.3.3.3.3">𝑖</ci></apply></apply></apply><apply id="S3.E9.m2.2.2.2.2.1.1.1.1.1.cmml" xref="S3.E9.m2.2.2.2.2.1.1.1.1"><csymbol cd="ambiguous" id="S3.E9.m2.2.2.2.2.1.1.1.1.1.1.cmml" xref="S3.E9.m2.2.2.2.2.1.1.1.1">subscript</csymbol><ci id="S3.E9.m2.2.2.2.2.1.1.1.1.1.2.cmml" xref="S3.E9.m2.2.2.2.2.1.1.1.1.1.2">𝒆</ci><apply id="S3.E9.m2.2.2.2.2.1.1.1.1.1.3.cmml" xref="S3.E9.m2.2.2.2.2.1.1.1.1.1.3"><times id="S3.E9.m2.2.2.2.2.1.1.1.1.1.3.1.cmml" xref="S3.E9.m2.2.2.2.2.1.1.1.1.1.3.1"></times><ci id="S3.E9.m2.2.2.2.2.1.1.1.1.1.3.2.cmml" xref="S3.E9.m2.2.2.2.2.1.1.1.1.1.3.2">𝑎</ci><ci id="S3.E9.m2.2.2.2.2.1.1.1.1.1.3.3.cmml" xref="S3.E9.m2.2.2.2.2.1.1.1.1.1.3.3">𝑠</ci><ci id="S3.E9.m2.2.2.2.2.1.1.1.1.1.3.4.cmml" xref="S3.E9.m2.2.2.2.2.1.1.1.1.1.3.4">𝑣</ci></apply></apply></apply></apply><apply id="S3.E9.m2.4.4.4.cmml" xref="S3.E9.m2.4.4.4"><apply id="S3.E9.m2.4.4.4.3.cmml" xref="S3.E9.m2.4.4.4.3"><csymbol cd="ambiguous" id="S3.E9.m2.4.4.4.3.1.cmml" xref="S3.E9.m2.4.4.4.3">superscript</csymbol><apply id="S3.E9.m2.4.4.4.3.2.cmml" xref="S3.E9.m2.4.4.4.3"><csymbol cd="ambiguous" id="S3.E9.m2.4.4.4.3.2.1.cmml" xref="S3.E9.m2.4.4.4.3">subscript</csymbol><sum id="S3.E9.m2.4.4.4.3.2.2.cmml" xref="S3.E9.m2.4.4.4.3.2.2"></sum><apply id="S3.E9.m2.4.4.4.3.2.3.cmml" xref="S3.E9.m2.4.4.4.3.2.3"><eq id="S3.E9.m2.4.4.4.3.2.3.1.cmml" xref="S3.E9.m2.4.4.4.3.2.3.1"></eq><ci id="S3.E9.m2.4.4.4.3.2.3.2.cmml" xref="S3.E9.m2.4.4.4.3.2.3.2">𝑗</ci><cn id="S3.E9.m2.4.4.4.3.2.3.3.cmml" type="integer" xref="S3.E9.m2.4.4.4.3.2.3.3">1</cn></apply></apply><ci id="S3.E9.m2.4.4.4.3.3.cmml" xref="S3.E9.m2.4.4.4.3.3">𝑆</ci></apply><apply id="S3.E9.m2.4.4.4.2.2.cmml" xref="S3.E9.m2.4.4.4.2.1"><exp id="S3.E9.m2.3.3.3.1.cmml" xref="S3.E9.m2.3.3.3.1"></exp><apply id="S3.E9.m2.4.4.4.2.1.1.1.cmml" xref="S3.E9.m2.4.4.4.2.1.1.1"><times id="S3.E9.m2.4.4.4.2.1.1.1.2.cmml" xref="S3.E9.m2.4.4.4.2.1.1.1.2"></times><apply id="S3.E9.m2.4.4.4.2.1.1.1.3.cmml" xref="S3.E9.m2.4.4.4.2.1.1.1.3"><csymbol cd="ambiguous" id="S3.E9.m2.4.4.4.2.1.1.1.3.1.cmml" xref="S3.E9.m2.4.4.4.2.1.1.1.3">subscript</csymbol><ci id="S3.E9.m2.4.4.4.2.1.1.1.3.2.cmml" xref="S3.E9.m2.4.4.4.2.1.1.1.3.2">ℎ</ci><apply id="S3.E9.m2.4.4.4.2.1.1.1.3.3.cmml" xref="S3.E9.m2.4.4.4.2.1.1.1.3.3"><csymbol cd="ambiguous" id="S3.E9.m2.4.4.4.2.1.1.1.3.3.1.cmml" xref="S3.E9.m2.4.4.4.2.1.1.1.3.3">superscript</csymbol><apply id="S3.E9.m2.4.4.4.2.1.1.1.3.3.2.cmml" xref="S3.E9.m2.4.4.4.2.1.1.1.3.3"><csymbol cd="ambiguous" id="S3.E9.m2.4.4.4.2.1.1.1.3.3.2.1.cmml" xref="S3.E9.m2.4.4.4.2.1.1.1.3.3">subscript</csymbol><ci id="S3.E9.m2.4.4.4.2.1.1.1.3.3.2.2.cmml" xref="S3.E9.m2.4.4.4.2.1.1.1.3.3.2.2">𝜽</ci><apply id="S3.E9.m2.4.4.4.2.1.1.1.3.3.2.3.cmml" xref="S3.E9.m2.4.4.4.2.1.1.1.3.3.2.3"><times id="S3.E9.m2.4.4.4.2.1.1.1.3.3.2.3.1.cmml" xref="S3.E9.m2.4.4.4.2.1.1.1.3.3.2.3.1"></times><ci id="S3.E9.m2.4.4.4.2.1.1.1.3.3.2.3.2.cmml" xref="S3.E9.m2.4.4.4.2.1.1.1.3.3.2.3.2">𝑎</ci><ci id="S3.E9.m2.4.4.4.2.1.1.1.3.3.2.3.3.cmml" xref="S3.E9.m2.4.4.4.2.1.1.1.3.3.2.3.3">𝑠</ci><ci id="S3.E9.m2.4.4.4.2.1.1.1.3.3.2.3.4.cmml" xref="S3.E9.m2.4.4.4.2.1.1.1.3.3.2.3.4">𝑣</ci></apply></apply><apply id="S3.E9.m2.4.4.4.2.1.1.1.3.3.3.cmml" xref="S3.E9.m2.4.4.4.2.1.1.1.3.3.3"><csymbol cd="ambiguous" id="S3.E9.m2.4.4.4.2.1.1.1.3.3.3.1.cmml" xref="S3.E9.m2.4.4.4.2.1.1.1.3.3.3">subscript</csymbol><ci id="S3.E9.m2.4.4.4.2.1.1.1.3.3.3.2.cmml" xref="S3.E9.m2.4.4.4.2.1.1.1.3.3.3.2">𝑠</ci><ci id="S3.E9.m2.4.4.4.2.1.1.1.3.3.3.3.cmml" xref="S3.E9.m2.4.4.4.2.1.1.1.3.3.3.3">𝑗</ci></apply></apply></apply><apply id="S3.E9.m2.4.4.4.2.1.1.1.1.1.1.cmml" xref="S3.E9.m2.4.4.4.2.1.1.1.1.1"><csymbol cd="ambiguous" id="S3.E9.m2.4.4.4.2.1.1.1.1.1.1.1.cmml" xref="S3.E9.m2.4.4.4.2.1.1.1.1.1">subscript</csymbol><ci id="S3.E9.m2.4.4.4.2.1.1.1.1.1.1.2.cmml" xref="S3.E9.m2.4.4.4.2.1.1.1.1.1.1.2">𝒆</ci><apply id="S3.E9.m2.4.4.4.2.1.1.1.1.1.1.3.cmml" xref="S3.E9.m2.4.4.4.2.1.1.1.1.1.1.3"><times id="S3.E9.m2.4.4.4.2.1.1.1.1.1.1.3.1.cmml" xref="S3.E9.m2.4.4.4.2.1.1.1.1.1.1.3.1"></times><ci id="S3.E9.m2.4.4.4.2.1.1.1.1.1.1.3.2.cmml" xref="S3.E9.m2.4.4.4.2.1.1.1.1.1.1.3.2">𝑎</ci><ci id="S3.E9.m2.4.4.4.2.1.1.1.1.1.1.3.3.cmml" xref="S3.E9.m2.4.4.4.2.1.1.1.1.1.1.3.3">𝑠</ci><ci id="S3.E9.m2.4.4.4.2.1.1.1.1.1.1.3.4.cmml" xref="S3.E9.m2.4.4.4.2.1.1.1.1.1.1.3.4">𝑣</ci></apply></apply></apply></apply></apply></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.E9.m2.5c">\displaystyle=\frac{\exp(h_{\boldsymbol{\theta}_{asv}^{s_{i}}}(\boldsymbol{e}_% {asv}))}{\sum_{j=1}^{S}\exp(h_{\boldsymbol{\theta}_{asv}^{s_{j}}}(\boldsymbol{% e}_{asv}))},</annotation><annotation encoding="application/x-llamapun" id="S3.E9.m2.5d">= divide start_ARG roman_exp ( italic_h start_POSTSUBSCRIPT bold_italic_θ start_POSTSUBSCRIPT italic_a italic_s italic_v end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_s start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( bold_italic_e start_POSTSUBSCRIPT italic_a italic_s italic_v end_POSTSUBSCRIPT ) ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S end_POSTSUPERSCRIPT roman_exp ( italic_h start_POSTSUBSCRIPT bold_italic_θ start_POSTSUBSCRIPT italic_a italic_s italic_v end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( bold_italic_e start_POSTSUBSCRIPT italic_a italic_s italic_v end_POSTSUBSCRIPT ) ) end_ARG ,</annotation></semantics></math></td> <td class="ltx_eqn_cell ltx_eqn_center_padright"></td> <td class="ltx_eqn_cell ltx_eqn_eqno ltx_align_middle ltx_align_right" rowspan="1"><span class="ltx_tag ltx_tag_equation ltx_align_right">(9)</span></td> </tr></tbody> <tbody id="S3.E10"><tr class="ltx_equation ltx_eqn_row ltx_align_baseline"> <td class="ltx_eqn_cell ltx_eqn_center_padleft"></td> <td class="ltx_td ltx_align_right ltx_eqn_cell"><math alttext="\displaystyle\mathcal{L}_{asv}" class="ltx_Math" display="inline" id="S3.E10.m1.1"><semantics id="S3.E10.m1.1a"><msub id="S3.E10.m1.1.1" xref="S3.E10.m1.1.1.cmml"><mi class="ltx_font_mathcaligraphic" id="S3.E10.m1.1.1.2" mathsize="90%" xref="S3.E10.m1.1.1.2.cmml">ℒ</mi><mrow id="S3.E10.m1.1.1.3" xref="S3.E10.m1.1.1.3.cmml"><mi id="S3.E10.m1.1.1.3.2" mathsize="90%" xref="S3.E10.m1.1.1.3.2.cmml">a</mi><mo id="S3.E10.m1.1.1.3.1" xref="S3.E10.m1.1.1.3.1.cmml">⁢</mo><mi id="S3.E10.m1.1.1.3.3" mathsize="90%" xref="S3.E10.m1.1.1.3.3.cmml">s</mi><mo id="S3.E10.m1.1.1.3.1a" xref="S3.E10.m1.1.1.3.1.cmml">⁢</mo><mi id="S3.E10.m1.1.1.3.4" mathsize="90%" xref="S3.E10.m1.1.1.3.4.cmml">v</mi></mrow></msub><annotation-xml encoding="MathML-Content" id="S3.E10.m1.1b"><apply id="S3.E10.m1.1.1.cmml" xref="S3.E10.m1.1.1"><csymbol cd="ambiguous" id="S3.E10.m1.1.1.1.cmml" xref="S3.E10.m1.1.1">subscript</csymbol><ci id="S3.E10.m1.1.1.2.cmml" xref="S3.E10.m1.1.1.2">ℒ</ci><apply id="S3.E10.m1.1.1.3.cmml" xref="S3.E10.m1.1.1.3"><times id="S3.E10.m1.1.1.3.1.cmml" xref="S3.E10.m1.1.1.3.1"></times><ci id="S3.E10.m1.1.1.3.2.cmml" xref="S3.E10.m1.1.1.3.2">𝑎</ci><ci id="S3.E10.m1.1.1.3.3.cmml" xref="S3.E10.m1.1.1.3.3">𝑠</ci><ci id="S3.E10.m1.1.1.3.4.cmml" xref="S3.E10.m1.1.1.3.4">𝑣</ci></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.E10.m1.1c">\displaystyle\mathcal{L}_{asv}</annotation><annotation encoding="application/x-llamapun" id="S3.E10.m1.1d">caligraphic_L start_POSTSUBSCRIPT italic_a italic_s italic_v end_POSTSUBSCRIPT</annotation></semantics></math></td> <td class="ltx_td ltx_align_left ltx_eqn_cell"><math alttext="\displaystyle=-\frac{1}{M}\sum_{m}^{M}\log P(\boldsymbol{e}_{asv}^{m},s_{i}^{m% })," class="ltx_Math" display="inline" id="S3.E10.m2.1"><semantics id="S3.E10.m2.1a"><mrow id="S3.E10.m2.1.1.1" xref="S3.E10.m2.1.1.1.1.cmml"><mrow id="S3.E10.m2.1.1.1.1" xref="S3.E10.m2.1.1.1.1.cmml"><mi id="S3.E10.m2.1.1.1.1.4" xref="S3.E10.m2.1.1.1.1.4.cmml"></mi><mo id="S3.E10.m2.1.1.1.1.3" mathsize="90%" xref="S3.E10.m2.1.1.1.1.3.cmml">=</mo><mrow id="S3.E10.m2.1.1.1.1.2" xref="S3.E10.m2.1.1.1.1.2.cmml"><mo id="S3.E10.m2.1.1.1.1.2a" mathsize="90%" xref="S3.E10.m2.1.1.1.1.2.cmml">−</mo><mrow id="S3.E10.m2.1.1.1.1.2.2" xref="S3.E10.m2.1.1.1.1.2.2.cmml"><mstyle displaystyle="true" id="S3.E10.m2.1.1.1.1.2.2.4" xref="S3.E10.m2.1.1.1.1.2.2.4.cmml"><mfrac id="S3.E10.m2.1.1.1.1.2.2.4a" xref="S3.E10.m2.1.1.1.1.2.2.4.cmml"><mn id="S3.E10.m2.1.1.1.1.2.2.4.2" mathsize="90%" xref="S3.E10.m2.1.1.1.1.2.2.4.2.cmml">1</mn><mi id="S3.E10.m2.1.1.1.1.2.2.4.3" mathsize="90%" xref="S3.E10.m2.1.1.1.1.2.2.4.3.cmml">M</mi></mfrac></mstyle><mo id="S3.E10.m2.1.1.1.1.2.2.3" xref="S3.E10.m2.1.1.1.1.2.2.3.cmml">⁢</mo><mrow id="S3.E10.m2.1.1.1.1.2.2.2" xref="S3.E10.m2.1.1.1.1.2.2.2.cmml"><mstyle displaystyle="true" id="S3.E10.m2.1.1.1.1.2.2.2.3" xref="S3.E10.m2.1.1.1.1.2.2.2.3.cmml"><munderover id="S3.E10.m2.1.1.1.1.2.2.2.3a" xref="S3.E10.m2.1.1.1.1.2.2.2.3.cmml"><mo id="S3.E10.m2.1.1.1.1.2.2.2.3.2.2" maxsize="90%" minsize="90%" movablelimits="false" stretchy="true" xref="S3.E10.m2.1.1.1.1.2.2.2.3.2.2.cmml">∑</mo><mi id="S3.E10.m2.1.1.1.1.2.2.2.3.2.3" mathsize="90%" xref="S3.E10.m2.1.1.1.1.2.2.2.3.2.3.cmml">m</mi><mi id="S3.E10.m2.1.1.1.1.2.2.2.3.3" mathsize="90%" xref="S3.E10.m2.1.1.1.1.2.2.2.3.3.cmml">M</mi></munderover></mstyle><mrow id="S3.E10.m2.1.1.1.1.2.2.2.2" xref="S3.E10.m2.1.1.1.1.2.2.2.2.cmml"><mrow id="S3.E10.m2.1.1.1.1.2.2.2.2.4" xref="S3.E10.m2.1.1.1.1.2.2.2.2.4.cmml"><mi id="S3.E10.m2.1.1.1.1.2.2.2.2.4.1" mathsize="90%" xref="S3.E10.m2.1.1.1.1.2.2.2.2.4.1.cmml">log</mi><mo id="S3.E10.m2.1.1.1.1.2.2.2.2.4a" lspace="0.167em" xref="S3.E10.m2.1.1.1.1.2.2.2.2.4.cmml">⁡</mo><mi id="S3.E10.m2.1.1.1.1.2.2.2.2.4.2" mathsize="90%" xref="S3.E10.m2.1.1.1.1.2.2.2.2.4.2.cmml">P</mi></mrow><mo id="S3.E10.m2.1.1.1.1.2.2.2.2.3" xref="S3.E10.m2.1.1.1.1.2.2.2.2.3.cmml">⁢</mo><mrow id="S3.E10.m2.1.1.1.1.2.2.2.2.2.2" xref="S3.E10.m2.1.1.1.1.2.2.2.2.2.3.cmml"><mo id="S3.E10.m2.1.1.1.1.2.2.2.2.2.2.3" maxsize="90%" minsize="90%" xref="S3.E10.m2.1.1.1.1.2.2.2.2.2.3.cmml">(</mo><msubsup id="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1" xref="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1.cmml"><mi id="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1.2.2" mathsize="90%" xref="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1.2.2.cmml">𝒆</mi><mrow id="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1.2.3" xref="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1.2.3.cmml"><mi id="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1.2.3.2" mathsize="90%" xref="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1.2.3.2.cmml">a</mi><mo id="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1.2.3.1" xref="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1.2.3.1.cmml">⁢</mo><mi id="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1.2.3.3" mathsize="90%" xref="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1.2.3.3.cmml">s</mi><mo id="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1.2.3.1a" xref="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1.2.3.1.cmml">⁢</mo><mi id="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1.2.3.4" mathsize="90%" xref="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1.2.3.4.cmml">v</mi></mrow><mi id="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1.3" mathsize="90%" xref="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1.3.cmml">m</mi></msubsup><mo id="S3.E10.m2.1.1.1.1.2.2.2.2.2.2.4" mathsize="90%" xref="S3.E10.m2.1.1.1.1.2.2.2.2.2.3.cmml">,</mo><msubsup id="S3.E10.m2.1.1.1.1.2.2.2.2.2.2.2" xref="S3.E10.m2.1.1.1.1.2.2.2.2.2.2.2.cmml"><mi id="S3.E10.m2.1.1.1.1.2.2.2.2.2.2.2.2.2" mathsize="90%" xref="S3.E10.m2.1.1.1.1.2.2.2.2.2.2.2.2.2.cmml">s</mi><mi id="S3.E10.m2.1.1.1.1.2.2.2.2.2.2.2.2.3" mathsize="90%" xref="S3.E10.m2.1.1.1.1.2.2.2.2.2.2.2.2.3.cmml">i</mi><mi id="S3.E10.m2.1.1.1.1.2.2.2.2.2.2.2.3" mathsize="90%" xref="S3.E10.m2.1.1.1.1.2.2.2.2.2.2.2.3.cmml">m</mi></msubsup><mo id="S3.E10.m2.1.1.1.1.2.2.2.2.2.2.5" maxsize="90%" minsize="90%" xref="S3.E10.m2.1.1.1.1.2.2.2.2.2.3.cmml">)</mo></mrow></mrow></mrow></mrow></mrow></mrow><mo id="S3.E10.m2.1.1.1.2" mathsize="90%" xref="S3.E10.m2.1.1.1.1.cmml">,</mo></mrow><annotation-xml encoding="MathML-Content" id="S3.E10.m2.1b"><apply id="S3.E10.m2.1.1.1.1.cmml" xref="S3.E10.m2.1.1.1"><eq id="S3.E10.m2.1.1.1.1.3.cmml" xref="S3.E10.m2.1.1.1.1.3"></eq><csymbol cd="latexml" id="S3.E10.m2.1.1.1.1.4.cmml" xref="S3.E10.m2.1.1.1.1.4">absent</csymbol><apply id="S3.E10.m2.1.1.1.1.2.cmml" xref="S3.E10.m2.1.1.1.1.2"><minus id="S3.E10.m2.1.1.1.1.2.3.cmml" xref="S3.E10.m2.1.1.1.1.2"></minus><apply id="S3.E10.m2.1.1.1.1.2.2.cmml" xref="S3.E10.m2.1.1.1.1.2.2"><times id="S3.E10.m2.1.1.1.1.2.2.3.cmml" xref="S3.E10.m2.1.1.1.1.2.2.3"></times><apply id="S3.E10.m2.1.1.1.1.2.2.4.cmml" xref="S3.E10.m2.1.1.1.1.2.2.4"><divide id="S3.E10.m2.1.1.1.1.2.2.4.1.cmml" xref="S3.E10.m2.1.1.1.1.2.2.4"></divide><cn id="S3.E10.m2.1.1.1.1.2.2.4.2.cmml" type="integer" xref="S3.E10.m2.1.1.1.1.2.2.4.2">1</cn><ci id="S3.E10.m2.1.1.1.1.2.2.4.3.cmml" xref="S3.E10.m2.1.1.1.1.2.2.4.3">𝑀</ci></apply><apply id="S3.E10.m2.1.1.1.1.2.2.2.cmml" xref="S3.E10.m2.1.1.1.1.2.2.2"><apply id="S3.E10.m2.1.1.1.1.2.2.2.3.cmml" xref="S3.E10.m2.1.1.1.1.2.2.2.3"><csymbol cd="ambiguous" id="S3.E10.m2.1.1.1.1.2.2.2.3.1.cmml" xref="S3.E10.m2.1.1.1.1.2.2.2.3">superscript</csymbol><apply id="S3.E10.m2.1.1.1.1.2.2.2.3.2.cmml" xref="S3.E10.m2.1.1.1.1.2.2.2.3"><csymbol cd="ambiguous" id="S3.E10.m2.1.1.1.1.2.2.2.3.2.1.cmml" xref="S3.E10.m2.1.1.1.1.2.2.2.3">subscript</csymbol><sum id="S3.E10.m2.1.1.1.1.2.2.2.3.2.2.cmml" xref="S3.E10.m2.1.1.1.1.2.2.2.3.2.2"></sum><ci id="S3.E10.m2.1.1.1.1.2.2.2.3.2.3.cmml" xref="S3.E10.m2.1.1.1.1.2.2.2.3.2.3">𝑚</ci></apply><ci id="S3.E10.m2.1.1.1.1.2.2.2.3.3.cmml" xref="S3.E10.m2.1.1.1.1.2.2.2.3.3">𝑀</ci></apply><apply id="S3.E10.m2.1.1.1.1.2.2.2.2.cmml" xref="S3.E10.m2.1.1.1.1.2.2.2.2"><times id="S3.E10.m2.1.1.1.1.2.2.2.2.3.cmml" xref="S3.E10.m2.1.1.1.1.2.2.2.2.3"></times><apply id="S3.E10.m2.1.1.1.1.2.2.2.2.4.cmml" xref="S3.E10.m2.1.1.1.1.2.2.2.2.4"><log id="S3.E10.m2.1.1.1.1.2.2.2.2.4.1.cmml" xref="S3.E10.m2.1.1.1.1.2.2.2.2.4.1"></log><ci id="S3.E10.m2.1.1.1.1.2.2.2.2.4.2.cmml" xref="S3.E10.m2.1.1.1.1.2.2.2.2.4.2">𝑃</ci></apply><interval closure="open" id="S3.E10.m2.1.1.1.1.2.2.2.2.2.3.cmml" xref="S3.E10.m2.1.1.1.1.2.2.2.2.2.2"><apply id="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1.cmml" xref="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1"><csymbol cd="ambiguous" id="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1.1.cmml" xref="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1">superscript</csymbol><apply id="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1.2.cmml" xref="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1"><csymbol cd="ambiguous" id="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1.2.1.cmml" xref="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1">subscript</csymbol><ci id="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1.2.2.cmml" xref="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1.2.2">𝒆</ci><apply id="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1.2.3.cmml" xref="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1.2.3"><times id="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1.2.3.1.cmml" xref="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1.2.3.1"></times><ci id="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1.2.3.2.cmml" xref="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1.2.3.2">𝑎</ci><ci id="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1.2.3.3.cmml" xref="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1.2.3.3">𝑠</ci><ci id="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1.2.3.4.cmml" xref="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1.2.3.4">𝑣</ci></apply></apply><ci id="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1.3.cmml" xref="S3.E10.m2.1.1.1.1.1.1.1.1.1.1.1.3">𝑚</ci></apply><apply id="S3.E10.m2.1.1.1.1.2.2.2.2.2.2.2.cmml" xref="S3.E10.m2.1.1.1.1.2.2.2.2.2.2.2"><csymbol cd="ambiguous" id="S3.E10.m2.1.1.1.1.2.2.2.2.2.2.2.1.cmml" xref="S3.E10.m2.1.1.1.1.2.2.2.2.2.2.2">superscript</csymbol><apply id="S3.E10.m2.1.1.1.1.2.2.2.2.2.2.2.2.cmml" xref="S3.E10.m2.1.1.1.1.2.2.2.2.2.2.2"><csymbol cd="ambiguous" id="S3.E10.m2.1.1.1.1.2.2.2.2.2.2.2.2.1.cmml" xref="S3.E10.m2.1.1.1.1.2.2.2.2.2.2.2">subscript</csymbol><ci id="S3.E10.m2.1.1.1.1.2.2.2.2.2.2.2.2.2.cmml" xref="S3.E10.m2.1.1.1.1.2.2.2.2.2.2.2.2.2">𝑠</ci><ci id="S3.E10.m2.1.1.1.1.2.2.2.2.2.2.2.2.3.cmml" xref="S3.E10.m2.1.1.1.1.2.2.2.2.2.2.2.2.3">𝑖</ci></apply><ci id="S3.E10.m2.1.1.1.1.2.2.2.2.2.2.2.3.cmml" xref="S3.E10.m2.1.1.1.1.2.2.2.2.2.2.2.3">𝑚</ci></apply></interval></apply></apply></apply></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.E10.m2.1c">\displaystyle=-\frac{1}{M}\sum_{m}^{M}\log P(\boldsymbol{e}_{asv}^{m},s_{i}^{m% }),</annotation><annotation encoding="application/x-llamapun" id="S3.E10.m2.1d">= - divide start_ARG 1 end_ARG start_ARG italic_M end_ARG ∑ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT roman_log italic_P ( bold_italic_e start_POSTSUBSCRIPT italic_a italic_s italic_v end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT , italic_s start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) ,</annotation></semantics></math></td> <td class="ltx_eqn_cell ltx_eqn_center_padright"></td> <td class="ltx_eqn_cell ltx_eqn_eqno ltx_align_middle ltx_align_right" rowspan="1"><span class="ltx_tag ltx_tag_equation ltx_align_right">(10)</span></td> </tr></tbody> </table> <p class="ltx_p" id="S3.SS2.SSS2.p2.12"><span class="ltx_text" id="S3.SS2.SSS2.p2.12.1" style="font-size:90%;">where </span><math alttext="P(\boldsymbol{e}_{spoof})" class="ltx_Math" display="inline" id="S3.SS2.SSS2.p2.3.m1.1"><semantics id="S3.SS2.SSS2.p2.3.m1.1a"><mrow id="S3.SS2.SSS2.p2.3.m1.1.1" xref="S3.SS2.SSS2.p2.3.m1.1.1.cmml"><mi id="S3.SS2.SSS2.p2.3.m1.1.1.3" mathsize="90%" xref="S3.SS2.SSS2.p2.3.m1.1.1.3.cmml">P</mi><mo id="S3.SS2.SSS2.p2.3.m1.1.1.2" xref="S3.SS2.SSS2.p2.3.m1.1.1.2.cmml">⁢</mo><mrow id="S3.SS2.SSS2.p2.3.m1.1.1.1.1" xref="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.cmml"><mo id="S3.SS2.SSS2.p2.3.m1.1.1.1.1.2" maxsize="90%" minsize="90%" xref="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.cmml">(</mo><msub id="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1" xref="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.cmml"><mi id="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.2" mathsize="90%" xref="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.2.cmml">𝒆</mi><mrow id="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.3" xref="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.3.cmml"><mi id="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.3.2" mathsize="90%" xref="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.3.2.cmml">s</mi><mo id="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.3.1" xref="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.3.3" mathsize="90%" xref="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.3.3.cmml">p</mi><mo id="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.3.1a" xref="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.3.4" mathsize="90%" xref="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.3.4.cmml">o</mi><mo id="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.3.1b" xref="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.3.5" mathsize="90%" xref="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.3.5.cmml">o</mi><mo id="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.3.1c" xref="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.3.6" mathsize="90%" xref="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.3.6.cmml">f</mi></mrow></msub><mo id="S3.SS2.SSS2.p2.3.m1.1.1.1.1.3" maxsize="90%" minsize="90%" xref="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.cmml">)</mo></mrow></mrow><annotation-xml encoding="MathML-Content" id="S3.SS2.SSS2.p2.3.m1.1b"><apply id="S3.SS2.SSS2.p2.3.m1.1.1.cmml" xref="S3.SS2.SSS2.p2.3.m1.1.1"><times id="S3.SS2.SSS2.p2.3.m1.1.1.2.cmml" xref="S3.SS2.SSS2.p2.3.m1.1.1.2"></times><ci id="S3.SS2.SSS2.p2.3.m1.1.1.3.cmml" xref="S3.SS2.SSS2.p2.3.m1.1.1.3">𝑃</ci><apply id="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.cmml" xref="S3.SS2.SSS2.p2.3.m1.1.1.1.1"><csymbol cd="ambiguous" id="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.1.cmml" xref="S3.SS2.SSS2.p2.3.m1.1.1.1.1">subscript</csymbol><ci id="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.2.cmml" xref="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.2">𝒆</ci><apply id="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.3.cmml" xref="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.3"><times id="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.3.1.cmml" xref="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.3.1"></times><ci id="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.3.2.cmml" xref="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.3.2">𝑠</ci><ci id="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.3.3.cmml" xref="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.3.3">𝑝</ci><ci id="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.3.4.cmml" xref="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.3.4">𝑜</ci><ci id="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.3.5.cmml" xref="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.3.5">𝑜</ci><ci id="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.3.6.cmml" xref="S3.SS2.SSS2.p2.3.m1.1.1.1.1.1.3.6">𝑓</ci></apply></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.SS2.SSS2.p2.3.m1.1c">P(\boldsymbol{e}_{spoof})</annotation><annotation encoding="application/x-llamapun" id="S3.SS2.SSS2.p2.3.m1.1d">italic_P ( bold_italic_e start_POSTSUBSCRIPT italic_s italic_p italic_o italic_o italic_f end_POSTSUBSCRIPT )</annotation></semantics></math><span class="ltx_text" id="S3.SS2.SSS2.p2.12.2" style="font-size:90%;"> and </span><math alttext="P(\boldsymbol{e}_{asv})" class="ltx_Math" display="inline" id="S3.SS2.SSS2.p2.4.m2.1"><semantics id="S3.SS2.SSS2.p2.4.m2.1a"><mrow id="S3.SS2.SSS2.p2.4.m2.1.1" xref="S3.SS2.SSS2.p2.4.m2.1.1.cmml"><mi id="S3.SS2.SSS2.p2.4.m2.1.1.3" mathsize="90%" xref="S3.SS2.SSS2.p2.4.m2.1.1.3.cmml">P</mi><mo id="S3.SS2.SSS2.p2.4.m2.1.1.2" xref="S3.SS2.SSS2.p2.4.m2.1.1.2.cmml">⁢</mo><mrow id="S3.SS2.SSS2.p2.4.m2.1.1.1.1" xref="S3.SS2.SSS2.p2.4.m2.1.1.1.1.1.cmml"><mo id="S3.SS2.SSS2.p2.4.m2.1.1.1.1.2" maxsize="90%" minsize="90%" xref="S3.SS2.SSS2.p2.4.m2.1.1.1.1.1.cmml">(</mo><msub id="S3.SS2.SSS2.p2.4.m2.1.1.1.1.1" xref="S3.SS2.SSS2.p2.4.m2.1.1.1.1.1.cmml"><mi id="S3.SS2.SSS2.p2.4.m2.1.1.1.1.1.2" mathsize="90%" xref="S3.SS2.SSS2.p2.4.m2.1.1.1.1.1.2.cmml">𝒆</mi><mrow id="S3.SS2.SSS2.p2.4.m2.1.1.1.1.1.3" xref="S3.SS2.SSS2.p2.4.m2.1.1.1.1.1.3.cmml"><mi id="S3.SS2.SSS2.p2.4.m2.1.1.1.1.1.3.2" mathsize="90%" xref="S3.SS2.SSS2.p2.4.m2.1.1.1.1.1.3.2.cmml">a</mi><mo id="S3.SS2.SSS2.p2.4.m2.1.1.1.1.1.3.1" xref="S3.SS2.SSS2.p2.4.m2.1.1.1.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS2.p2.4.m2.1.1.1.1.1.3.3" mathsize="90%" xref="S3.SS2.SSS2.p2.4.m2.1.1.1.1.1.3.3.cmml">s</mi><mo id="S3.SS2.SSS2.p2.4.m2.1.1.1.1.1.3.1a" xref="S3.SS2.SSS2.p2.4.m2.1.1.1.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS2.p2.4.m2.1.1.1.1.1.3.4" mathsize="90%" xref="S3.SS2.SSS2.p2.4.m2.1.1.1.1.1.3.4.cmml">v</mi></mrow></msub><mo id="S3.SS2.SSS2.p2.4.m2.1.1.1.1.3" maxsize="90%" minsize="90%" xref="S3.SS2.SSS2.p2.4.m2.1.1.1.1.1.cmml">)</mo></mrow></mrow><annotation-xml encoding="MathML-Content" id="S3.SS2.SSS2.p2.4.m2.1b"><apply id="S3.SS2.SSS2.p2.4.m2.1.1.cmml" xref="S3.SS2.SSS2.p2.4.m2.1.1"><times id="S3.SS2.SSS2.p2.4.m2.1.1.2.cmml" xref="S3.SS2.SSS2.p2.4.m2.1.1.2"></times><ci id="S3.SS2.SSS2.p2.4.m2.1.1.3.cmml" xref="S3.SS2.SSS2.p2.4.m2.1.1.3">𝑃</ci><apply id="S3.SS2.SSS2.p2.4.m2.1.1.1.1.1.cmml" xref="S3.SS2.SSS2.p2.4.m2.1.1.1.1"><csymbol cd="ambiguous" id="S3.SS2.SSS2.p2.4.m2.1.1.1.1.1.1.cmml" xref="S3.SS2.SSS2.p2.4.m2.1.1.1.1">subscript</csymbol><ci id="S3.SS2.SSS2.p2.4.m2.1.1.1.1.1.2.cmml" xref="S3.SS2.SSS2.p2.4.m2.1.1.1.1.1.2">𝒆</ci><apply id="S3.SS2.SSS2.p2.4.m2.1.1.1.1.1.3.cmml" xref="S3.SS2.SSS2.p2.4.m2.1.1.1.1.1.3"><times id="S3.SS2.SSS2.p2.4.m2.1.1.1.1.1.3.1.cmml" xref="S3.SS2.SSS2.p2.4.m2.1.1.1.1.1.3.1"></times><ci id="S3.SS2.SSS2.p2.4.m2.1.1.1.1.1.3.2.cmml" xref="S3.SS2.SSS2.p2.4.m2.1.1.1.1.1.3.2">𝑎</ci><ci id="S3.SS2.SSS2.p2.4.m2.1.1.1.1.1.3.3.cmml" xref="S3.SS2.SSS2.p2.4.m2.1.1.1.1.1.3.3">𝑠</ci><ci id="S3.SS2.SSS2.p2.4.m2.1.1.1.1.1.3.4.cmml" xref="S3.SS2.SSS2.p2.4.m2.1.1.1.1.1.3.4">𝑣</ci></apply></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.SS2.SSS2.p2.4.m2.1c">P(\boldsymbol{e}_{asv})</annotation><annotation encoding="application/x-llamapun" id="S3.SS2.SSS2.p2.4.m2.1d">italic_P ( bold_italic_e start_POSTSUBSCRIPT italic_a italic_s italic_v end_POSTSUBSCRIPT )</annotation></semantics></math><span class="ltx_text" id="S3.SS2.SSS2.p2.12.3" style="font-size:90%;"> represent the probabilities for the anti-spoofing and ASV tasks, respectively. </span><math alttext="\mathcal{I}(\cdot)" class="ltx_Math" display="inline" id="S3.SS2.SSS2.p2.5.m3.1"><semantics id="S3.SS2.SSS2.p2.5.m3.1a"><mrow id="S3.SS2.SSS2.p2.5.m3.1.2" xref="S3.SS2.SSS2.p2.5.m3.1.2.cmml"><mi class="ltx_font_mathcaligraphic" id="S3.SS2.SSS2.p2.5.m3.1.2.2" mathsize="90%" xref="S3.SS2.SSS2.p2.5.m3.1.2.2.cmml">ℐ</mi><mo id="S3.SS2.SSS2.p2.5.m3.1.2.1" xref="S3.SS2.SSS2.p2.5.m3.1.2.1.cmml">⁢</mo><mrow id="S3.SS2.SSS2.p2.5.m3.1.2.3.2" xref="S3.SS2.SSS2.p2.5.m3.1.2.cmml"><mo id="S3.SS2.SSS2.p2.5.m3.1.2.3.2.1" maxsize="90%" minsize="90%" xref="S3.SS2.SSS2.p2.5.m3.1.2.cmml">(</mo><mo id="S3.SS2.SSS2.p2.5.m3.1.1" lspace="0em" mathsize="90%" rspace="0em" xref="S3.SS2.SSS2.p2.5.m3.1.1.cmml">⋅</mo><mo id="S3.SS2.SSS2.p2.5.m3.1.2.3.2.2" maxsize="90%" minsize="90%" xref="S3.SS2.SSS2.p2.5.m3.1.2.cmml">)</mo></mrow></mrow><annotation-xml encoding="MathML-Content" id="S3.SS2.SSS2.p2.5.m3.1b"><apply id="S3.SS2.SSS2.p2.5.m3.1.2.cmml" xref="S3.SS2.SSS2.p2.5.m3.1.2"><times id="S3.SS2.SSS2.p2.5.m3.1.2.1.cmml" xref="S3.SS2.SSS2.p2.5.m3.1.2.1"></times><ci id="S3.SS2.SSS2.p2.5.m3.1.2.2.cmml" xref="S3.SS2.SSS2.p2.5.m3.1.2.2">ℐ</ci><ci id="S3.SS2.SSS2.p2.5.m3.1.1.cmml" xref="S3.SS2.SSS2.p2.5.m3.1.1">⋅</ci></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.SS2.SSS2.p2.5.m3.1c">\mathcal{I}(\cdot)</annotation><annotation encoding="application/x-llamapun" id="S3.SS2.SSS2.p2.5.m3.1d">caligraphic_I ( ⋅ )</annotation></semantics></math><span class="ltx_text" id="S3.SS2.SSS2.p2.12.4" style="font-size:90%;"> is an indicator function that returns 1 if the condition is true and 0 otherwise. The variable </span><math alttext="S" class="ltx_Math" display="inline" id="S3.SS2.SSS2.p2.6.m4.1"><semantics id="S3.SS2.SSS2.p2.6.m4.1a"><mi id="S3.SS2.SSS2.p2.6.m4.1.1" mathsize="90%" xref="S3.SS2.SSS2.p2.6.m4.1.1.cmml">S</mi><annotation-xml encoding="MathML-Content" id="S3.SS2.SSS2.p2.6.m4.1b"><ci id="S3.SS2.SSS2.p2.6.m4.1.1.cmml" xref="S3.SS2.SSS2.p2.6.m4.1.1">𝑆</ci></annotation-xml><annotation encoding="application/x-tex" id="S3.SS2.SSS2.p2.6.m4.1c">S</annotation><annotation encoding="application/x-llamapun" id="S3.SS2.SSS2.p2.6.m4.1d">italic_S</annotation></semantics></math><span class="ltx_text" id="S3.SS2.SSS2.p2.12.5" style="font-size:90%;"> denotes the number of speakers in the training dataset, while </span><math alttext="M" class="ltx_Math" display="inline" id="S3.SS2.SSS2.p2.7.m5.1"><semantics id="S3.SS2.SSS2.p2.7.m5.1a"><mi id="S3.SS2.SSS2.p2.7.m5.1.1" mathsize="90%" xref="S3.SS2.SSS2.p2.7.m5.1.1.cmml">M</mi><annotation-xml encoding="MathML-Content" id="S3.SS2.SSS2.p2.7.m5.1b"><ci id="S3.SS2.SSS2.p2.7.m5.1.1.cmml" xref="S3.SS2.SSS2.p2.7.m5.1.1">𝑀</ci></annotation-xml><annotation encoding="application/x-tex" id="S3.SS2.SSS2.p2.7.m5.1c">M</annotation><annotation encoding="application/x-llamapun" id="S3.SS2.SSS2.p2.7.m5.1d">italic_M</annotation></semantics></math><span class="ltx_text" id="S3.SS2.SSS2.p2.12.6" style="font-size:90%;"> represents the number of training samples. </span><math alttext="s_{i}^{m}" class="ltx_Math" display="inline" id="S3.SS2.SSS2.p2.8.m6.1"><semantics id="S3.SS2.SSS2.p2.8.m6.1a"><msubsup id="S3.SS2.SSS2.p2.8.m6.1.1" xref="S3.SS2.SSS2.p2.8.m6.1.1.cmml"><mi id="S3.SS2.SSS2.p2.8.m6.1.1.2.2" mathsize="90%" xref="S3.SS2.SSS2.p2.8.m6.1.1.2.2.cmml">s</mi><mi id="S3.SS2.SSS2.p2.8.m6.1.1.2.3" mathsize="90%" xref="S3.SS2.SSS2.p2.8.m6.1.1.2.3.cmml">i</mi><mi id="S3.SS2.SSS2.p2.8.m6.1.1.3" mathsize="90%" xref="S3.SS2.SSS2.p2.8.m6.1.1.3.cmml">m</mi></msubsup><annotation-xml encoding="MathML-Content" id="S3.SS2.SSS2.p2.8.m6.1b"><apply id="S3.SS2.SSS2.p2.8.m6.1.1.cmml" xref="S3.SS2.SSS2.p2.8.m6.1.1"><csymbol cd="ambiguous" id="S3.SS2.SSS2.p2.8.m6.1.1.1.cmml" xref="S3.SS2.SSS2.p2.8.m6.1.1">superscript</csymbol><apply id="S3.SS2.SSS2.p2.8.m6.1.1.2.cmml" xref="S3.SS2.SSS2.p2.8.m6.1.1"><csymbol cd="ambiguous" id="S3.SS2.SSS2.p2.8.m6.1.1.2.1.cmml" xref="S3.SS2.SSS2.p2.8.m6.1.1">subscript</csymbol><ci id="S3.SS2.SSS2.p2.8.m6.1.1.2.2.cmml" xref="S3.SS2.SSS2.p2.8.m6.1.1.2.2">𝑠</ci><ci id="S3.SS2.SSS2.p2.8.m6.1.1.2.3.cmml" xref="S3.SS2.SSS2.p2.8.m6.1.1.2.3">𝑖</ci></apply><ci id="S3.SS2.SSS2.p2.8.m6.1.1.3.cmml" xref="S3.SS2.SSS2.p2.8.m6.1.1.3">𝑚</ci></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.SS2.SSS2.p2.8.m6.1c">s_{i}^{m}</annotation><annotation encoding="application/x-llamapun" id="S3.SS2.SSS2.p2.8.m6.1d">italic_s start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT</annotation></semantics></math><span class="ltx_text" id="S3.SS2.SSS2.p2.12.7" style="font-size:90%;"> signifies the </span><math alttext="m" class="ltx_Math" display="inline" id="S3.SS2.SSS2.p2.9.m7.1"><semantics id="S3.SS2.SSS2.p2.9.m7.1a"><mi id="S3.SS2.SSS2.p2.9.m7.1.1" mathsize="90%" xref="S3.SS2.SSS2.p2.9.m7.1.1.cmml">m</mi><annotation-xml encoding="MathML-Content" id="S3.SS2.SSS2.p2.9.m7.1b"><ci id="S3.SS2.SSS2.p2.9.m7.1.1.cmml" xref="S3.SS2.SSS2.p2.9.m7.1.1">𝑚</ci></annotation-xml><annotation encoding="application/x-tex" id="S3.SS2.SSS2.p2.9.m7.1c">m</annotation><annotation encoding="application/x-llamapun" id="S3.SS2.SSS2.p2.9.m7.1d">italic_m</annotation></semantics></math><span class="ltx_text" id="S3.SS2.SSS2.p2.12.8" style="font-size:90%;">-th training sample with the </span><math alttext="i" class="ltx_Math" display="inline" id="S3.SS2.SSS2.p2.10.m8.1"><semantics id="S3.SS2.SSS2.p2.10.m8.1a"><mi id="S3.SS2.SSS2.p2.10.m8.1.1" mathsize="90%" xref="S3.SS2.SSS2.p2.10.m8.1.1.cmml">i</mi><annotation-xml encoding="MathML-Content" id="S3.SS2.SSS2.p2.10.m8.1b"><ci id="S3.SS2.SSS2.p2.10.m8.1.1.cmml" xref="S3.SS2.SSS2.p2.10.m8.1.1">𝑖</ci></annotation-xml><annotation encoding="application/x-tex" id="S3.SS2.SSS2.p2.10.m8.1c">i</annotation><annotation encoding="application/x-llamapun" id="S3.SS2.SSS2.p2.10.m8.1d">italic_i</annotation></semantics></math><span class="ltx_text" id="S3.SS2.SSS2.p2.12.9" style="font-size:90%;">-th speaker in the training dataset. Furthermore, </span><math alttext="\boldsymbol{\theta}_{spoof}" class="ltx_Math" display="inline" id="S3.SS2.SSS2.p2.11.m9.1"><semantics id="S3.SS2.SSS2.p2.11.m9.1a"><msub id="S3.SS2.SSS2.p2.11.m9.1.1" xref="S3.SS2.SSS2.p2.11.m9.1.1.cmml"><mi id="S3.SS2.SSS2.p2.11.m9.1.1.2" mathsize="90%" xref="S3.SS2.SSS2.p2.11.m9.1.1.2.cmml">𝜽</mi><mrow id="S3.SS2.SSS2.p2.11.m9.1.1.3" xref="S3.SS2.SSS2.p2.11.m9.1.1.3.cmml"><mi id="S3.SS2.SSS2.p2.11.m9.1.1.3.2" mathsize="90%" xref="S3.SS2.SSS2.p2.11.m9.1.1.3.2.cmml">s</mi><mo id="S3.SS2.SSS2.p2.11.m9.1.1.3.1" xref="S3.SS2.SSS2.p2.11.m9.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS2.p2.11.m9.1.1.3.3" mathsize="90%" xref="S3.SS2.SSS2.p2.11.m9.1.1.3.3.cmml">p</mi><mo id="S3.SS2.SSS2.p2.11.m9.1.1.3.1a" xref="S3.SS2.SSS2.p2.11.m9.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS2.p2.11.m9.1.1.3.4" mathsize="90%" xref="S3.SS2.SSS2.p2.11.m9.1.1.3.4.cmml">o</mi><mo id="S3.SS2.SSS2.p2.11.m9.1.1.3.1b" xref="S3.SS2.SSS2.p2.11.m9.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS2.p2.11.m9.1.1.3.5" mathsize="90%" xref="S3.SS2.SSS2.p2.11.m9.1.1.3.5.cmml">o</mi><mo id="S3.SS2.SSS2.p2.11.m9.1.1.3.1c" xref="S3.SS2.SSS2.p2.11.m9.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS2.p2.11.m9.1.1.3.6" mathsize="90%" xref="S3.SS2.SSS2.p2.11.m9.1.1.3.6.cmml">f</mi></mrow></msub><annotation-xml encoding="MathML-Content" id="S3.SS2.SSS2.p2.11.m9.1b"><apply id="S3.SS2.SSS2.p2.11.m9.1.1.cmml" xref="S3.SS2.SSS2.p2.11.m9.1.1"><csymbol cd="ambiguous" id="S3.SS2.SSS2.p2.11.m9.1.1.1.cmml" xref="S3.SS2.SSS2.p2.11.m9.1.1">subscript</csymbol><ci id="S3.SS2.SSS2.p2.11.m9.1.1.2.cmml" xref="S3.SS2.SSS2.p2.11.m9.1.1.2">𝜽</ci><apply id="S3.SS2.SSS2.p2.11.m9.1.1.3.cmml" xref="S3.SS2.SSS2.p2.11.m9.1.1.3"><times id="S3.SS2.SSS2.p2.11.m9.1.1.3.1.cmml" xref="S3.SS2.SSS2.p2.11.m9.1.1.3.1"></times><ci id="S3.SS2.SSS2.p2.11.m9.1.1.3.2.cmml" xref="S3.SS2.SSS2.p2.11.m9.1.1.3.2">𝑠</ci><ci id="S3.SS2.SSS2.p2.11.m9.1.1.3.3.cmml" xref="S3.SS2.SSS2.p2.11.m9.1.1.3.3">𝑝</ci><ci id="S3.SS2.SSS2.p2.11.m9.1.1.3.4.cmml" xref="S3.SS2.SSS2.p2.11.m9.1.1.3.4">𝑜</ci><ci id="S3.SS2.SSS2.p2.11.m9.1.1.3.5.cmml" xref="S3.SS2.SSS2.p2.11.m9.1.1.3.5">𝑜</ci><ci id="S3.SS2.SSS2.p2.11.m9.1.1.3.6.cmml" xref="S3.SS2.SSS2.p2.11.m9.1.1.3.6">𝑓</ci></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.SS2.SSS2.p2.11.m9.1c">\boldsymbol{\theta}_{spoof}</annotation><annotation encoding="application/x-llamapun" id="S3.SS2.SSS2.p2.11.m9.1d">bold_italic_θ start_POSTSUBSCRIPT italic_s italic_p italic_o italic_o italic_f end_POSTSUBSCRIPT</annotation></semantics></math><span class="ltx_text" id="S3.SS2.SSS2.p2.12.10" style="font-size:90%;"> and </span><math alttext="\boldsymbol{\theta}_{asv}" class="ltx_Math" display="inline" id="S3.SS2.SSS2.p2.12.m10.1"><semantics id="S3.SS2.SSS2.p2.12.m10.1a"><msub id="S3.SS2.SSS2.p2.12.m10.1.1" xref="S3.SS2.SSS2.p2.12.m10.1.1.cmml"><mi id="S3.SS2.SSS2.p2.12.m10.1.1.2" mathsize="90%" xref="S3.SS2.SSS2.p2.12.m10.1.1.2.cmml">𝜽</mi><mrow id="S3.SS2.SSS2.p2.12.m10.1.1.3" xref="S3.SS2.SSS2.p2.12.m10.1.1.3.cmml"><mi id="S3.SS2.SSS2.p2.12.m10.1.1.3.2" mathsize="90%" xref="S3.SS2.SSS2.p2.12.m10.1.1.3.2.cmml">a</mi><mo id="S3.SS2.SSS2.p2.12.m10.1.1.3.1" xref="S3.SS2.SSS2.p2.12.m10.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS2.p2.12.m10.1.1.3.3" mathsize="90%" xref="S3.SS2.SSS2.p2.12.m10.1.1.3.3.cmml">s</mi><mo id="S3.SS2.SSS2.p2.12.m10.1.1.3.1a" xref="S3.SS2.SSS2.p2.12.m10.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS2.p2.12.m10.1.1.3.4" mathsize="90%" xref="S3.SS2.SSS2.p2.12.m10.1.1.3.4.cmml">v</mi></mrow></msub><annotation-xml encoding="MathML-Content" id="S3.SS2.SSS2.p2.12.m10.1b"><apply id="S3.SS2.SSS2.p2.12.m10.1.1.cmml" xref="S3.SS2.SSS2.p2.12.m10.1.1"><csymbol cd="ambiguous" id="S3.SS2.SSS2.p2.12.m10.1.1.1.cmml" xref="S3.SS2.SSS2.p2.12.m10.1.1">subscript</csymbol><ci id="S3.SS2.SSS2.p2.12.m10.1.1.2.cmml" xref="S3.SS2.SSS2.p2.12.m10.1.1.2">𝜽</ci><apply id="S3.SS2.SSS2.p2.12.m10.1.1.3.cmml" xref="S3.SS2.SSS2.p2.12.m10.1.1.3"><times id="S3.SS2.SSS2.p2.12.m10.1.1.3.1.cmml" xref="S3.SS2.SSS2.p2.12.m10.1.1.3.1"></times><ci id="S3.SS2.SSS2.p2.12.m10.1.1.3.2.cmml" xref="S3.SS2.SSS2.p2.12.m10.1.1.3.2">𝑎</ci><ci id="S3.SS2.SSS2.p2.12.m10.1.1.3.3.cmml" xref="S3.SS2.SSS2.p2.12.m10.1.1.3.3">𝑠</ci><ci id="S3.SS2.SSS2.p2.12.m10.1.1.3.4.cmml" xref="S3.SS2.SSS2.p2.12.m10.1.1.3.4">𝑣</ci></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.SS2.SSS2.p2.12.m10.1c">\boldsymbol{\theta}_{asv}</annotation><annotation encoding="application/x-llamapun" id="S3.SS2.SSS2.p2.12.m10.1d">bold_italic_θ start_POSTSUBSCRIPT italic_a italic_s italic_v end_POSTSUBSCRIPT</annotation></semantics></math><span class="ltx_text" id="S3.SS2.SSS2.p2.12.11" style="font-size:90%;"> correspond to the parameters of the output layers, respectively.</span></p> </div> </section> <section class="ltx_subsubsection" id="S3.SS2.SSS3"> <h4 class="ltx_title ltx_title_subsubsection" style="font-size:90%;"> <span class="ltx_tag ltx_tag_subsubsection">3.2.3 </span>SASV binary classifier</h4> <div class="ltx_para" id="S3.SS2.SSS3.p1"> <p class="ltx_p" id="S3.SS2.SSS3.p1.1"><span class="ltx_text" id="S3.SS2.SSS3.p1.1.1" style="font-size:90%;">For the SASV binary classifier, the initial step involves aggregating the outputs </span><math alttext="\boldsymbol{H}_{fuse}" class="ltx_Math" display="inline" id="S3.SS2.SSS3.p1.1.m1.1"><semantics id="S3.SS2.SSS3.p1.1.m1.1a"><msub id="S3.SS2.SSS3.p1.1.m1.1.1" xref="S3.SS2.SSS3.p1.1.m1.1.1.cmml"><mi id="S3.SS2.SSS3.p1.1.m1.1.1.2" mathsize="90%" xref="S3.SS2.SSS3.p1.1.m1.1.1.2.cmml">𝑯</mi><mrow id="S3.SS2.SSS3.p1.1.m1.1.1.3" xref="S3.SS2.SSS3.p1.1.m1.1.1.3.cmml"><mi id="S3.SS2.SSS3.p1.1.m1.1.1.3.2" mathsize="90%" xref="S3.SS2.SSS3.p1.1.m1.1.1.3.2.cmml">f</mi><mo id="S3.SS2.SSS3.p1.1.m1.1.1.3.1" xref="S3.SS2.SSS3.p1.1.m1.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS3.p1.1.m1.1.1.3.3" mathsize="90%" xref="S3.SS2.SSS3.p1.1.m1.1.1.3.3.cmml">u</mi><mo id="S3.SS2.SSS3.p1.1.m1.1.1.3.1a" xref="S3.SS2.SSS3.p1.1.m1.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS3.p1.1.m1.1.1.3.4" mathsize="90%" xref="S3.SS2.SSS3.p1.1.m1.1.1.3.4.cmml">s</mi><mo id="S3.SS2.SSS3.p1.1.m1.1.1.3.1b" xref="S3.SS2.SSS3.p1.1.m1.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS3.p1.1.m1.1.1.3.5" mathsize="90%" xref="S3.SS2.SSS3.p1.1.m1.1.1.3.5.cmml">e</mi></mrow></msub><annotation-xml encoding="MathML-Content" id="S3.SS2.SSS3.p1.1.m1.1b"><apply id="S3.SS2.SSS3.p1.1.m1.1.1.cmml" xref="S3.SS2.SSS3.p1.1.m1.1.1"><csymbol cd="ambiguous" id="S3.SS2.SSS3.p1.1.m1.1.1.1.cmml" xref="S3.SS2.SSS3.p1.1.m1.1.1">subscript</csymbol><ci id="S3.SS2.SSS3.p1.1.m1.1.1.2.cmml" xref="S3.SS2.SSS3.p1.1.m1.1.1.2">𝑯</ci><apply id="S3.SS2.SSS3.p1.1.m1.1.1.3.cmml" xref="S3.SS2.SSS3.p1.1.m1.1.1.3"><times id="S3.SS2.SSS3.p1.1.m1.1.1.3.1.cmml" xref="S3.SS2.SSS3.p1.1.m1.1.1.3.1"></times><ci id="S3.SS2.SSS3.p1.1.m1.1.1.3.2.cmml" xref="S3.SS2.SSS3.p1.1.m1.1.1.3.2">𝑓</ci><ci id="S3.SS2.SSS3.p1.1.m1.1.1.3.3.cmml" xref="S3.SS2.SSS3.p1.1.m1.1.1.3.3">𝑢</ci><ci id="S3.SS2.SSS3.p1.1.m1.1.1.3.4.cmml" xref="S3.SS2.SSS3.p1.1.m1.1.1.3.4">𝑠</ci><ci id="S3.SS2.SSS3.p1.1.m1.1.1.3.5.cmml" xref="S3.SS2.SSS3.p1.1.m1.1.1.3.5">𝑒</ci></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.SS2.SSS3.p1.1.m1.1c">\boldsymbol{H}_{fuse}</annotation><annotation encoding="application/x-llamapun" id="S3.SS2.SSS3.p1.1.m1.1d">bold_italic_H start_POSTSUBSCRIPT italic_f italic_u italic_s italic_e end_POSTSUBSCRIPT</annotation></semantics></math><span class="ltx_text" id="S3.SS2.SSS3.p1.1.2" style="font-size:90%;"> from each block by summation. Subsequently, layer normalization is applied to the aggregation, as expressed in the following equation:</span></p> <table class="ltx_equationgroup ltx_eqn_align ltx_eqn_table" id="S6.EGx5"> <tbody id="S3.E11"><tr class="ltx_equation ltx_eqn_row ltx_align_baseline"> <td class="ltx_eqn_cell ltx_eqn_center_padleft"></td> <td class="ltx_td ltx_align_right ltx_eqn_cell"><math alttext="\displaystyle\boldsymbol{H}_{fuse}" class="ltx_Math" display="inline" id="S3.E11.m1.1"><semantics id="S3.E11.m1.1a"><msub id="S3.E11.m1.1.1" xref="S3.E11.m1.1.1.cmml"><mi id="S3.E11.m1.1.1.2" mathsize="90%" xref="S3.E11.m1.1.1.2.cmml">𝑯</mi><mrow id="S3.E11.m1.1.1.3" xref="S3.E11.m1.1.1.3.cmml"><mi id="S3.E11.m1.1.1.3.2" mathsize="90%" xref="S3.E11.m1.1.1.3.2.cmml">f</mi><mo id="S3.E11.m1.1.1.3.1" xref="S3.E11.m1.1.1.3.1.cmml">⁢</mo><mi id="S3.E11.m1.1.1.3.3" mathsize="90%" xref="S3.E11.m1.1.1.3.3.cmml">u</mi><mo id="S3.E11.m1.1.1.3.1a" xref="S3.E11.m1.1.1.3.1.cmml">⁢</mo><mi id="S3.E11.m1.1.1.3.4" mathsize="90%" xref="S3.E11.m1.1.1.3.4.cmml">s</mi><mo id="S3.E11.m1.1.1.3.1b" xref="S3.E11.m1.1.1.3.1.cmml">⁢</mo><mi id="S3.E11.m1.1.1.3.5" mathsize="90%" xref="S3.E11.m1.1.1.3.5.cmml">e</mi></mrow></msub><annotation-xml encoding="MathML-Content" id="S3.E11.m1.1b"><apply id="S3.E11.m1.1.1.cmml" xref="S3.E11.m1.1.1"><csymbol cd="ambiguous" id="S3.E11.m1.1.1.1.cmml" xref="S3.E11.m1.1.1">subscript</csymbol><ci id="S3.E11.m1.1.1.2.cmml" xref="S3.E11.m1.1.1.2">𝑯</ci><apply id="S3.E11.m1.1.1.3.cmml" xref="S3.E11.m1.1.1.3"><times id="S3.E11.m1.1.1.3.1.cmml" xref="S3.E11.m1.1.1.3.1"></times><ci id="S3.E11.m1.1.1.3.2.cmml" xref="S3.E11.m1.1.1.3.2">𝑓</ci><ci id="S3.E11.m1.1.1.3.3.cmml" xref="S3.E11.m1.1.1.3.3">𝑢</ci><ci id="S3.E11.m1.1.1.3.4.cmml" xref="S3.E11.m1.1.1.3.4">𝑠</ci><ci id="S3.E11.m1.1.1.3.5.cmml" xref="S3.E11.m1.1.1.3.5">𝑒</ci></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.E11.m1.1c">\displaystyle\boldsymbol{H}_{fuse}</annotation><annotation encoding="application/x-llamapun" id="S3.E11.m1.1d">bold_italic_H start_POSTSUBSCRIPT italic_f italic_u italic_s italic_e end_POSTSUBSCRIPT</annotation></semantics></math></td> <td class="ltx_td ltx_align_left ltx_eqn_cell"><math alttext="\displaystyle=\text{LN}(\sum_{n=1}^{N}\boldsymbol{H}_{final}^{n})," class="ltx_Math" display="inline" id="S3.E11.m2.1"><semantics id="S3.E11.m2.1a"><mrow id="S3.E11.m2.1.1.1" xref="S3.E11.m2.1.1.1.1.cmml"><mrow id="S3.E11.m2.1.1.1.1" xref="S3.E11.m2.1.1.1.1.cmml"><mi id="S3.E11.m2.1.1.1.1.3" xref="S3.E11.m2.1.1.1.1.3.cmml"></mi><mo id="S3.E11.m2.1.1.1.1.2" mathsize="90%" xref="S3.E11.m2.1.1.1.1.2.cmml">=</mo><mrow id="S3.E11.m2.1.1.1.1.1" xref="S3.E11.m2.1.1.1.1.1.cmml"><mtext id="S3.E11.m2.1.1.1.1.1.3" mathsize="90%" xref="S3.E11.m2.1.1.1.1.1.3a.cmml">LN</mtext><mo id="S3.E11.m2.1.1.1.1.1.2" xref="S3.E11.m2.1.1.1.1.1.2.cmml">⁢</mo><mrow id="S3.E11.m2.1.1.1.1.1.1.1" xref="S3.E11.m2.1.1.1.1.1.1.1.1.cmml"><mo id="S3.E11.m2.1.1.1.1.1.1.1.2" maxsize="90%" minsize="90%" xref="S3.E11.m2.1.1.1.1.1.1.1.1.cmml">(</mo><mrow id="S3.E11.m2.1.1.1.1.1.1.1.1" xref="S3.E11.m2.1.1.1.1.1.1.1.1.cmml"><mstyle displaystyle="true" id="S3.E11.m2.1.1.1.1.1.1.1.1.1" xref="S3.E11.m2.1.1.1.1.1.1.1.1.1.cmml"><munderover id="S3.E11.m2.1.1.1.1.1.1.1.1.1a" xref="S3.E11.m2.1.1.1.1.1.1.1.1.1.cmml"><mo id="S3.E11.m2.1.1.1.1.1.1.1.1.1.2.2" maxsize="90%" minsize="90%" movablelimits="false" stretchy="true" xref="S3.E11.m2.1.1.1.1.1.1.1.1.1.2.2.cmml">∑</mo><mrow id="S3.E11.m2.1.1.1.1.1.1.1.1.1.2.3" xref="S3.E11.m2.1.1.1.1.1.1.1.1.1.2.3.cmml"><mi id="S3.E11.m2.1.1.1.1.1.1.1.1.1.2.3.2" mathsize="90%" xref="S3.E11.m2.1.1.1.1.1.1.1.1.1.2.3.2.cmml">n</mi><mo id="S3.E11.m2.1.1.1.1.1.1.1.1.1.2.3.1" mathsize="90%" xref="S3.E11.m2.1.1.1.1.1.1.1.1.1.2.3.1.cmml">=</mo><mn id="S3.E11.m2.1.1.1.1.1.1.1.1.1.2.3.3" mathsize="90%" xref="S3.E11.m2.1.1.1.1.1.1.1.1.1.2.3.3.cmml">1</mn></mrow><mi id="S3.E11.m2.1.1.1.1.1.1.1.1.1.3" mathsize="90%" xref="S3.E11.m2.1.1.1.1.1.1.1.1.1.3.cmml">N</mi></munderover></mstyle><msubsup id="S3.E11.m2.1.1.1.1.1.1.1.1.2" xref="S3.E11.m2.1.1.1.1.1.1.1.1.2.cmml"><mi id="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.2" mathsize="90%" xref="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.2.cmml">𝑯</mi><mrow id="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.3" xref="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.3.cmml"><mi id="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.3.2" mathsize="90%" xref="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.3.2.cmml">f</mi><mo id="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.3.1" xref="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.3.1.cmml">⁢</mo><mi id="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.3.3" mathsize="90%" xref="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.3.3.cmml">i</mi><mo id="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.3.1a" xref="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.3.1.cmml">⁢</mo><mi id="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.3.4" mathsize="90%" xref="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.3.4.cmml">n</mi><mo id="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.3.1b" xref="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.3.1.cmml">⁢</mo><mi id="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.3.5" mathsize="90%" xref="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.3.5.cmml">a</mi><mo id="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.3.1c" xref="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.3.1.cmml">⁢</mo><mi id="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.3.6" mathsize="90%" xref="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.3.6.cmml">l</mi></mrow><mi id="S3.E11.m2.1.1.1.1.1.1.1.1.2.3" mathsize="90%" xref="S3.E11.m2.1.1.1.1.1.1.1.1.2.3.cmml">n</mi></msubsup></mrow><mo id="S3.E11.m2.1.1.1.1.1.1.1.3" maxsize="90%" minsize="90%" xref="S3.E11.m2.1.1.1.1.1.1.1.1.cmml">)</mo></mrow></mrow></mrow><mo id="S3.E11.m2.1.1.1.2" mathsize="90%" xref="S3.E11.m2.1.1.1.1.cmml">,</mo></mrow><annotation-xml encoding="MathML-Content" id="S3.E11.m2.1b"><apply id="S3.E11.m2.1.1.1.1.cmml" xref="S3.E11.m2.1.1.1"><eq id="S3.E11.m2.1.1.1.1.2.cmml" xref="S3.E11.m2.1.1.1.1.2"></eq><csymbol cd="latexml" id="S3.E11.m2.1.1.1.1.3.cmml" xref="S3.E11.m2.1.1.1.1.3">absent</csymbol><apply id="S3.E11.m2.1.1.1.1.1.cmml" xref="S3.E11.m2.1.1.1.1.1"><times id="S3.E11.m2.1.1.1.1.1.2.cmml" xref="S3.E11.m2.1.1.1.1.1.2"></times><ci id="S3.E11.m2.1.1.1.1.1.3a.cmml" xref="S3.E11.m2.1.1.1.1.1.3"><mtext id="S3.E11.m2.1.1.1.1.1.3.cmml" mathsize="90%" xref="S3.E11.m2.1.1.1.1.1.3">LN</mtext></ci><apply id="S3.E11.m2.1.1.1.1.1.1.1.1.cmml" xref="S3.E11.m2.1.1.1.1.1.1.1"><apply id="S3.E11.m2.1.1.1.1.1.1.1.1.1.cmml" xref="S3.E11.m2.1.1.1.1.1.1.1.1.1"><csymbol cd="ambiguous" id="S3.E11.m2.1.1.1.1.1.1.1.1.1.1.cmml" xref="S3.E11.m2.1.1.1.1.1.1.1.1.1">superscript</csymbol><apply id="S3.E11.m2.1.1.1.1.1.1.1.1.1.2.cmml" xref="S3.E11.m2.1.1.1.1.1.1.1.1.1"><csymbol cd="ambiguous" id="S3.E11.m2.1.1.1.1.1.1.1.1.1.2.1.cmml" xref="S3.E11.m2.1.1.1.1.1.1.1.1.1">subscript</csymbol><sum id="S3.E11.m2.1.1.1.1.1.1.1.1.1.2.2.cmml" xref="S3.E11.m2.1.1.1.1.1.1.1.1.1.2.2"></sum><apply id="S3.E11.m2.1.1.1.1.1.1.1.1.1.2.3.cmml" xref="S3.E11.m2.1.1.1.1.1.1.1.1.1.2.3"><eq id="S3.E11.m2.1.1.1.1.1.1.1.1.1.2.3.1.cmml" xref="S3.E11.m2.1.1.1.1.1.1.1.1.1.2.3.1"></eq><ci id="S3.E11.m2.1.1.1.1.1.1.1.1.1.2.3.2.cmml" xref="S3.E11.m2.1.1.1.1.1.1.1.1.1.2.3.2">𝑛</ci><cn id="S3.E11.m2.1.1.1.1.1.1.1.1.1.2.3.3.cmml" type="integer" xref="S3.E11.m2.1.1.1.1.1.1.1.1.1.2.3.3">1</cn></apply></apply><ci id="S3.E11.m2.1.1.1.1.1.1.1.1.1.3.cmml" xref="S3.E11.m2.1.1.1.1.1.1.1.1.1.3">𝑁</ci></apply><apply id="S3.E11.m2.1.1.1.1.1.1.1.1.2.cmml" xref="S3.E11.m2.1.1.1.1.1.1.1.1.2"><csymbol cd="ambiguous" id="S3.E11.m2.1.1.1.1.1.1.1.1.2.1.cmml" xref="S3.E11.m2.1.1.1.1.1.1.1.1.2">superscript</csymbol><apply id="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.cmml" xref="S3.E11.m2.1.1.1.1.1.1.1.1.2"><csymbol cd="ambiguous" id="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.1.cmml" xref="S3.E11.m2.1.1.1.1.1.1.1.1.2">subscript</csymbol><ci id="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.2.cmml" xref="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.2">𝑯</ci><apply id="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.3.cmml" xref="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.3"><times id="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.3.1.cmml" xref="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.3.1"></times><ci id="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.3.2.cmml" xref="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.3.2">𝑓</ci><ci id="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.3.3.cmml" xref="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.3.3">𝑖</ci><ci id="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.3.4.cmml" xref="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.3.4">𝑛</ci><ci id="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.3.5.cmml" xref="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.3.5">𝑎</ci><ci id="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.3.6.cmml" xref="S3.E11.m2.1.1.1.1.1.1.1.1.2.2.3.6">𝑙</ci></apply></apply><ci id="S3.E11.m2.1.1.1.1.1.1.1.1.2.3.cmml" xref="S3.E11.m2.1.1.1.1.1.1.1.1.2.3">𝑛</ci></apply></apply></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.E11.m2.1c">\displaystyle=\text{LN}(\sum_{n=1}^{N}\boldsymbol{H}_{final}^{n}),</annotation><annotation encoding="application/x-llamapun" id="S3.E11.m2.1d">= LN ( ∑ start_POSTSUBSCRIPT italic_n = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT bold_italic_H start_POSTSUBSCRIPT italic_f italic_i italic_n italic_a italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ) ,</annotation></semantics></math></td> <td class="ltx_eqn_cell ltx_eqn_center_padright"></td> <td class="ltx_eqn_cell ltx_eqn_eqno ltx_align_middle ltx_align_right" rowspan="1"><span class="ltx_tag ltx_tag_equation ltx_align_right">(11)</span></td> </tr></tbody> </table> <p class="ltx_p" id="S3.SS2.SSS3.p1.3"><span class="ltx_text" id="S3.SS2.SSS3.p1.3.1" style="font-size:90%;">Here, </span><math alttext="\boldsymbol{H}_{fuse}" class="ltx_Math" display="inline" id="S3.SS2.SSS3.p1.2.m1.1"><semantics id="S3.SS2.SSS3.p1.2.m1.1a"><msub id="S3.SS2.SSS3.p1.2.m1.1.1" xref="S3.SS2.SSS3.p1.2.m1.1.1.cmml"><mi id="S3.SS2.SSS3.p1.2.m1.1.1.2" mathsize="90%" xref="S3.SS2.SSS3.p1.2.m1.1.1.2.cmml">𝑯</mi><mrow id="S3.SS2.SSS3.p1.2.m1.1.1.3" xref="S3.SS2.SSS3.p1.2.m1.1.1.3.cmml"><mi id="S3.SS2.SSS3.p1.2.m1.1.1.3.2" mathsize="90%" xref="S3.SS2.SSS3.p1.2.m1.1.1.3.2.cmml">f</mi><mo id="S3.SS2.SSS3.p1.2.m1.1.1.3.1" xref="S3.SS2.SSS3.p1.2.m1.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS3.p1.2.m1.1.1.3.3" mathsize="90%" xref="S3.SS2.SSS3.p1.2.m1.1.1.3.3.cmml">u</mi><mo id="S3.SS2.SSS3.p1.2.m1.1.1.3.1a" xref="S3.SS2.SSS3.p1.2.m1.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS3.p1.2.m1.1.1.3.4" mathsize="90%" xref="S3.SS2.SSS3.p1.2.m1.1.1.3.4.cmml">s</mi><mo id="S3.SS2.SSS3.p1.2.m1.1.1.3.1b" xref="S3.SS2.SSS3.p1.2.m1.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS3.p1.2.m1.1.1.3.5" mathsize="90%" xref="S3.SS2.SSS3.p1.2.m1.1.1.3.5.cmml">e</mi></mrow></msub><annotation-xml encoding="MathML-Content" id="S3.SS2.SSS3.p1.2.m1.1b"><apply id="S3.SS2.SSS3.p1.2.m1.1.1.cmml" xref="S3.SS2.SSS3.p1.2.m1.1.1"><csymbol cd="ambiguous" id="S3.SS2.SSS3.p1.2.m1.1.1.1.cmml" xref="S3.SS2.SSS3.p1.2.m1.1.1">subscript</csymbol><ci id="S3.SS2.SSS3.p1.2.m1.1.1.2.cmml" xref="S3.SS2.SSS3.p1.2.m1.1.1.2">𝑯</ci><apply id="S3.SS2.SSS3.p1.2.m1.1.1.3.cmml" xref="S3.SS2.SSS3.p1.2.m1.1.1.3"><times id="S3.SS2.SSS3.p1.2.m1.1.1.3.1.cmml" xref="S3.SS2.SSS3.p1.2.m1.1.1.3.1"></times><ci id="S3.SS2.SSS3.p1.2.m1.1.1.3.2.cmml" xref="S3.SS2.SSS3.p1.2.m1.1.1.3.2">𝑓</ci><ci id="S3.SS2.SSS3.p1.2.m1.1.1.3.3.cmml" xref="S3.SS2.SSS3.p1.2.m1.1.1.3.3">𝑢</ci><ci id="S3.SS2.SSS3.p1.2.m1.1.1.3.4.cmml" xref="S3.SS2.SSS3.p1.2.m1.1.1.3.4">𝑠</ci><ci id="S3.SS2.SSS3.p1.2.m1.1.1.3.5.cmml" xref="S3.SS2.SSS3.p1.2.m1.1.1.3.5">𝑒</ci></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.SS2.SSS3.p1.2.m1.1c">\boldsymbol{H}_{fuse}</annotation><annotation encoding="application/x-llamapun" id="S3.SS2.SSS3.p1.2.m1.1d">bold_italic_H start_POSTSUBSCRIPT italic_f italic_u italic_s italic_e end_POSTSUBSCRIPT</annotation></semantics></math><span class="ltx_text" id="S3.SS2.SSS3.p1.3.2" style="font-size:90%;"> represents the output of the aggregation. Following this, the aggregated features undergo transformation through a one-dimensional convolutional layer and a pooling layer, generating an embedding denoted as </span><math alttext="\boldsymbol{e}_{fuse}" class="ltx_Math" display="inline" id="S3.SS2.SSS3.p1.3.m2.1"><semantics id="S3.SS2.SSS3.p1.3.m2.1a"><msub id="S3.SS2.SSS3.p1.3.m2.1.1" xref="S3.SS2.SSS3.p1.3.m2.1.1.cmml"><mi id="S3.SS2.SSS3.p1.3.m2.1.1.2" mathsize="90%" xref="S3.SS2.SSS3.p1.3.m2.1.1.2.cmml">𝒆</mi><mrow id="S3.SS2.SSS3.p1.3.m2.1.1.3" xref="S3.SS2.SSS3.p1.3.m2.1.1.3.cmml"><mi id="S3.SS2.SSS3.p1.3.m2.1.1.3.2" mathsize="90%" xref="S3.SS2.SSS3.p1.3.m2.1.1.3.2.cmml">f</mi><mo id="S3.SS2.SSS3.p1.3.m2.1.1.3.1" xref="S3.SS2.SSS3.p1.3.m2.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS3.p1.3.m2.1.1.3.3" mathsize="90%" xref="S3.SS2.SSS3.p1.3.m2.1.1.3.3.cmml">u</mi><mo id="S3.SS2.SSS3.p1.3.m2.1.1.3.1a" xref="S3.SS2.SSS3.p1.3.m2.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS3.p1.3.m2.1.1.3.4" mathsize="90%" xref="S3.SS2.SSS3.p1.3.m2.1.1.3.4.cmml">s</mi><mo id="S3.SS2.SSS3.p1.3.m2.1.1.3.1b" xref="S3.SS2.SSS3.p1.3.m2.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS3.p1.3.m2.1.1.3.5" mathsize="90%" xref="S3.SS2.SSS3.p1.3.m2.1.1.3.5.cmml">e</mi></mrow></msub><annotation-xml encoding="MathML-Content" id="S3.SS2.SSS3.p1.3.m2.1b"><apply id="S3.SS2.SSS3.p1.3.m2.1.1.cmml" xref="S3.SS2.SSS3.p1.3.m2.1.1"><csymbol cd="ambiguous" id="S3.SS2.SSS3.p1.3.m2.1.1.1.cmml" xref="S3.SS2.SSS3.p1.3.m2.1.1">subscript</csymbol><ci id="S3.SS2.SSS3.p1.3.m2.1.1.2.cmml" xref="S3.SS2.SSS3.p1.3.m2.1.1.2">𝒆</ci><apply id="S3.SS2.SSS3.p1.3.m2.1.1.3.cmml" xref="S3.SS2.SSS3.p1.3.m2.1.1.3"><times id="S3.SS2.SSS3.p1.3.m2.1.1.3.1.cmml" xref="S3.SS2.SSS3.p1.3.m2.1.1.3.1"></times><ci id="S3.SS2.SSS3.p1.3.m2.1.1.3.2.cmml" xref="S3.SS2.SSS3.p1.3.m2.1.1.3.2">𝑓</ci><ci id="S3.SS2.SSS3.p1.3.m2.1.1.3.3.cmml" xref="S3.SS2.SSS3.p1.3.m2.1.1.3.3">𝑢</ci><ci id="S3.SS2.SSS3.p1.3.m2.1.1.3.4.cmml" xref="S3.SS2.SSS3.p1.3.m2.1.1.3.4">𝑠</ci><ci id="S3.SS2.SSS3.p1.3.m2.1.1.3.5.cmml" xref="S3.SS2.SSS3.p1.3.m2.1.1.3.5">𝑒</ci></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.SS2.SSS3.p1.3.m2.1c">\boldsymbol{e}_{fuse}</annotation><annotation encoding="application/x-llamapun" id="S3.SS2.SSS3.p1.3.m2.1d">bold_italic_e start_POSTSUBSCRIPT italic_f italic_u italic_s italic_e end_POSTSUBSCRIPT</annotation></semantics></math><span class="ltx_text" id="S3.SS2.SSS3.p1.3.3" style="font-size:90%;">.</span></p> </div> <div class="ltx_para" id="S3.SS2.SSS3.p2"> <p class="ltx_p" id="S3.SS2.SSS3.p2.1"><span class="ltx_text" id="S3.SS2.SSS3.p2.1.1" style="font-size:90%;">The architecture employed for the sub-module labeled “SASV back-end” in Figure </span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S3.F2" style="font-size:90%;" title="Figure 2 ‣ 3.1 Overview ‣ 3 Proposed integrated approach ‣ Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_tag">2</span></a><span class="ltx_text" id="S3.SS2.SSS3.p2.1.2" style="font-size:90%;"> is identical to that outlined in </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S3.SS2.SSS3.p2.1.3.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib27" title="">27</a><span class="ltx_text" id="S3.SS2.SSS3.p2.1.4.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S3.SS2.SSS3.p2.1.5" style="font-size:90%;">. However, our approach is distinct due to the assertion that the informative embedding </span><math alttext="\boldsymbol{e}_{fuse}" class="ltx_Math" display="inline" id="S3.SS2.SSS3.p2.1.m1.1"><semantics id="S3.SS2.SSS3.p2.1.m1.1a"><msub id="S3.SS2.SSS3.p2.1.m1.1.1" xref="S3.SS2.SSS3.p2.1.m1.1.1.cmml"><mi id="S3.SS2.SSS3.p2.1.m1.1.1.2" mathsize="90%" xref="S3.SS2.SSS3.p2.1.m1.1.1.2.cmml">𝒆</mi><mrow id="S3.SS2.SSS3.p2.1.m1.1.1.3" xref="S3.SS2.SSS3.p2.1.m1.1.1.3.cmml"><mi id="S3.SS2.SSS3.p2.1.m1.1.1.3.2" mathsize="90%" xref="S3.SS2.SSS3.p2.1.m1.1.1.3.2.cmml">f</mi><mo id="S3.SS2.SSS3.p2.1.m1.1.1.3.1" xref="S3.SS2.SSS3.p2.1.m1.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS3.p2.1.m1.1.1.3.3" mathsize="90%" xref="S3.SS2.SSS3.p2.1.m1.1.1.3.3.cmml">u</mi><mo id="S3.SS2.SSS3.p2.1.m1.1.1.3.1a" xref="S3.SS2.SSS3.p2.1.m1.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS3.p2.1.m1.1.1.3.4" mathsize="90%" xref="S3.SS2.SSS3.p2.1.m1.1.1.3.4.cmml">s</mi><mo id="S3.SS2.SSS3.p2.1.m1.1.1.3.1b" xref="S3.SS2.SSS3.p2.1.m1.1.1.3.1.cmml">⁢</mo><mi id="S3.SS2.SSS3.p2.1.m1.1.1.3.5" mathsize="90%" xref="S3.SS2.SSS3.p2.1.m1.1.1.3.5.cmml">e</mi></mrow></msub><annotation-xml encoding="MathML-Content" id="S3.SS2.SSS3.p2.1.m1.1b"><apply id="S3.SS2.SSS3.p2.1.m1.1.1.cmml" xref="S3.SS2.SSS3.p2.1.m1.1.1"><csymbol cd="ambiguous" id="S3.SS2.SSS3.p2.1.m1.1.1.1.cmml" xref="S3.SS2.SSS3.p2.1.m1.1.1">subscript</csymbol><ci id="S3.SS2.SSS3.p2.1.m1.1.1.2.cmml" xref="S3.SS2.SSS3.p2.1.m1.1.1.2">𝒆</ci><apply id="S3.SS2.SSS3.p2.1.m1.1.1.3.cmml" xref="S3.SS2.SSS3.p2.1.m1.1.1.3"><times id="S3.SS2.SSS3.p2.1.m1.1.1.3.1.cmml" xref="S3.SS2.SSS3.p2.1.m1.1.1.3.1"></times><ci id="S3.SS2.SSS3.p2.1.m1.1.1.3.2.cmml" xref="S3.SS2.SSS3.p2.1.m1.1.1.3.2">𝑓</ci><ci id="S3.SS2.SSS3.p2.1.m1.1.1.3.3.cmml" xref="S3.SS2.SSS3.p2.1.m1.1.1.3.3">𝑢</ci><ci id="S3.SS2.SSS3.p2.1.m1.1.1.3.4.cmml" xref="S3.SS2.SSS3.p2.1.m1.1.1.3.4">𝑠</ci><ci id="S3.SS2.SSS3.p2.1.m1.1.1.3.5.cmml" xref="S3.SS2.SSS3.p2.1.m1.1.1.3.5">𝑒</ci></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.SS2.SSS3.p2.1.m1.1c">\boldsymbol{e}_{fuse}</annotation><annotation encoding="application/x-llamapun" id="S3.SS2.SSS3.p2.1.m1.1d">bold_italic_e start_POSTSUBSCRIPT italic_f italic_u italic_s italic_e end_POSTSUBSCRIPT</annotation></semantics></math><span class="ltx_text" id="S3.SS2.SSS3.p2.1.6" style="font-size:90%;"> can serve both the anti-spoofing and ASV tasks. Consequently, this embedding is utilized for both CM decision and ASV decision, which diverges from the input used in the SASV back-end, as detailed in </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S3.SS2.SSS3.p2.1.7.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib27" title="">27</a><span class="ltx_text" id="S3.SS2.SSS3.p2.1.8.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S3.SS2.SSS3.p2.1.9" style="font-size:90%;">.</span></p> </div> </section> </section> <section class="ltx_subsection" id="S3.SS3"> <h3 class="ltx_title ltx_title_subsection" style="font-size:90%;"> <span class="ltx_tag ltx_tag_subsection">3.3 </span>Learning paradigm</h3> <div class="ltx_para" id="S3.SS3.p1"> <p class="ltx_p" id="S3.SS3.p1.9"><span class="ltx_text" id="S3.SS3.p1.9.1" style="font-size:90%;">To confer resilience upon the proposed model, we integrate the pair-wise learning paradigm, simulation of spoofing attacks, and meta-learning paradigm. As elucidated in </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S3.SS3.p1.9.2.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib31" title="">31</a><span class="ltx_text" id="S3.SS3.p1.9.3.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S3.SS3.p1.9.4" style="font-size:90%;">, meta-learning can be understood as a process of bilevel optimization involving a hierarchical optimization problem where one optimization contains another as a constraint </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S3.SS3.p1.9.5.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib45" title="">45</a>, <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib46" title="">46</a><span class="ltx_text" id="S3.SS3.p1.9.6.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S3.SS3.p1.9.7" style="font-size:90%;">. This bilevel optimization is divided into two loops, the outer and inner, which effectively simulate domain mismatch during the training of machine learning models. It can be formalized as follows:</span></p> <table class="ltx_equationgroup ltx_eqn_align ltx_eqn_table" id="S6.EGx6"> <tbody id="S3.Ex2"><tr class="ltx_equation ltx_eqn_row ltx_align_baseline"> <td class="ltx_eqn_cell ltx_eqn_center_padleft"></td> <td class="ltx_td ltx_align_right ltx_eqn_cell"><math alttext="\displaystyle\theta_{i+1}" class="ltx_Math" display="inline" id="S3.Ex2.m1.1"><semantics id="S3.Ex2.m1.1a"><msub id="S3.Ex2.m1.1.1" xref="S3.Ex2.m1.1.1.cmml"><mi id="S3.Ex2.m1.1.1.2" mathsize="90%" xref="S3.Ex2.m1.1.1.2.cmml">θ</mi><mrow id="S3.Ex2.m1.1.1.3" xref="S3.Ex2.m1.1.1.3.cmml"><mi id="S3.Ex2.m1.1.1.3.2" mathsize="90%" xref="S3.Ex2.m1.1.1.3.2.cmml">i</mi><mo id="S3.Ex2.m1.1.1.3.1" mathsize="90%" xref="S3.Ex2.m1.1.1.3.1.cmml">+</mo><mn id="S3.Ex2.m1.1.1.3.3" mathsize="90%" xref="S3.Ex2.m1.1.1.3.3.cmml">1</mn></mrow></msub><annotation-xml encoding="MathML-Content" id="S3.Ex2.m1.1b"><apply id="S3.Ex2.m1.1.1.cmml" xref="S3.Ex2.m1.1.1"><csymbol cd="ambiguous" id="S3.Ex2.m1.1.1.1.cmml" xref="S3.Ex2.m1.1.1">subscript</csymbol><ci id="S3.Ex2.m1.1.1.2.cmml" xref="S3.Ex2.m1.1.1.2">𝜃</ci><apply id="S3.Ex2.m1.1.1.3.cmml" xref="S3.Ex2.m1.1.1.3"><plus id="S3.Ex2.m1.1.1.3.1.cmml" xref="S3.Ex2.m1.1.1.3.1"></plus><ci id="S3.Ex2.m1.1.1.3.2.cmml" xref="S3.Ex2.m1.1.1.3.2">𝑖</ci><cn id="S3.Ex2.m1.1.1.3.3.cmml" type="integer" xref="S3.Ex2.m1.1.1.3.3">1</cn></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.Ex2.m1.1c">\displaystyle\theta_{i+1}</annotation><annotation encoding="application/x-llamapun" id="S3.Ex2.m1.1d">italic_θ start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT</annotation></semantics></math></td> <td class="ltx_td ltx_align_left ltx_eqn_cell"><math alttext="\displaystyle=\operatorname*{argmin}_{\theta}\sum_{i}^{T}\mathcal{L}^{outer}(% \theta_{i}^{*},\mathcal{D}_{source}^{meta-test(i)})" class="ltx_Math" display="inline" id="S3.Ex2.m2.3"><semantics id="S3.Ex2.m2.3a"><mrow id="S3.Ex2.m2.3.3" xref="S3.Ex2.m2.3.3.cmml"><mi id="S3.Ex2.m2.3.3.4" xref="S3.Ex2.m2.3.3.4.cmml"></mi><mo id="S3.Ex2.m2.3.3.3" mathsize="90%" rspace="0.1389em" xref="S3.Ex2.m2.3.3.3.cmml">=</mo><mrow id="S3.Ex2.m2.3.3.2" xref="S3.Ex2.m2.3.3.2.cmml"><munder id="S3.Ex2.m2.3.3.2.4" xref="S3.Ex2.m2.3.3.2.4.cmml"><mo id="S3.Ex2.m2.3.3.2.4.2" lspace="0.1389em" mathsize="90%" xref="S3.Ex2.m2.3.3.2.4.2.cmml">argmin</mo><mi id="S3.Ex2.m2.3.3.2.4.3" mathsize="90%" xref="S3.Ex2.m2.3.3.2.4.3.cmml">θ</mi></munder><mo id="S3.Ex2.m2.3.3.2.3" lspace="0.167em" xref="S3.Ex2.m2.3.3.2.3.cmml">⁢</mo><mrow id="S3.Ex2.m2.3.3.2.2" xref="S3.Ex2.m2.3.3.2.2.cmml"><mstyle displaystyle="true" id="S3.Ex2.m2.3.3.2.2.3" xref="S3.Ex2.m2.3.3.2.2.3.cmml"><munderover id="S3.Ex2.m2.3.3.2.2.3a" xref="S3.Ex2.m2.3.3.2.2.3.cmml"><mo id="S3.Ex2.m2.3.3.2.2.3.2.2" maxsize="90%" minsize="90%" movablelimits="false" stretchy="true" xref="S3.Ex2.m2.3.3.2.2.3.2.2.cmml">∑</mo><mi id="S3.Ex2.m2.3.3.2.2.3.2.3" mathsize="90%" xref="S3.Ex2.m2.3.3.2.2.3.2.3.cmml">i</mi><mi id="S3.Ex2.m2.3.3.2.2.3.3" mathsize="90%" xref="S3.Ex2.m2.3.3.2.2.3.3.cmml">T</mi></munderover></mstyle><mrow id="S3.Ex2.m2.3.3.2.2.2" xref="S3.Ex2.m2.3.3.2.2.2.cmml"><msup id="S3.Ex2.m2.3.3.2.2.2.4" xref="S3.Ex2.m2.3.3.2.2.2.4.cmml"><mi class="ltx_font_mathcaligraphic" id="S3.Ex2.m2.3.3.2.2.2.4.2" mathsize="90%" xref="S3.Ex2.m2.3.3.2.2.2.4.2.cmml">ℒ</mi><mrow id="S3.Ex2.m2.3.3.2.2.2.4.3" xref="S3.Ex2.m2.3.3.2.2.2.4.3.cmml"><mi id="S3.Ex2.m2.3.3.2.2.2.4.3.2" mathsize="90%" xref="S3.Ex2.m2.3.3.2.2.2.4.3.2.cmml">o</mi><mo id="S3.Ex2.m2.3.3.2.2.2.4.3.1" xref="S3.Ex2.m2.3.3.2.2.2.4.3.1.cmml">⁢</mo><mi id="S3.Ex2.m2.3.3.2.2.2.4.3.3" mathsize="90%" xref="S3.Ex2.m2.3.3.2.2.2.4.3.3.cmml">u</mi><mo id="S3.Ex2.m2.3.3.2.2.2.4.3.1a" xref="S3.Ex2.m2.3.3.2.2.2.4.3.1.cmml">⁢</mo><mi id="S3.Ex2.m2.3.3.2.2.2.4.3.4" mathsize="90%" xref="S3.Ex2.m2.3.3.2.2.2.4.3.4.cmml">t</mi><mo id="S3.Ex2.m2.3.3.2.2.2.4.3.1b" xref="S3.Ex2.m2.3.3.2.2.2.4.3.1.cmml">⁢</mo><mi id="S3.Ex2.m2.3.3.2.2.2.4.3.5" mathsize="90%" xref="S3.Ex2.m2.3.3.2.2.2.4.3.5.cmml">e</mi><mo id="S3.Ex2.m2.3.3.2.2.2.4.3.1c" xref="S3.Ex2.m2.3.3.2.2.2.4.3.1.cmml">⁢</mo><mi id="S3.Ex2.m2.3.3.2.2.2.4.3.6" mathsize="90%" xref="S3.Ex2.m2.3.3.2.2.2.4.3.6.cmml">r</mi></mrow></msup><mo id="S3.Ex2.m2.3.3.2.2.2.3" xref="S3.Ex2.m2.3.3.2.2.2.3.cmml">⁢</mo><mrow id="S3.Ex2.m2.3.3.2.2.2.2.2" xref="S3.Ex2.m2.3.3.2.2.2.2.3.cmml"><mo id="S3.Ex2.m2.3.3.2.2.2.2.2.3" maxsize="90%" minsize="90%" xref="S3.Ex2.m2.3.3.2.2.2.2.3.cmml">(</mo><msubsup id="S3.Ex2.m2.2.2.1.1.1.1.1.1" xref="S3.Ex2.m2.2.2.1.1.1.1.1.1.cmml"><mi id="S3.Ex2.m2.2.2.1.1.1.1.1.1.2.2" mathsize="90%" xref="S3.Ex2.m2.2.2.1.1.1.1.1.1.2.2.cmml">θ</mi><mi id="S3.Ex2.m2.2.2.1.1.1.1.1.1.2.3" mathsize="90%" xref="S3.Ex2.m2.2.2.1.1.1.1.1.1.2.3.cmml">i</mi><mo id="S3.Ex2.m2.2.2.1.1.1.1.1.1.3" mathsize="90%" xref="S3.Ex2.m2.2.2.1.1.1.1.1.1.3.cmml">∗</mo></msubsup><mo id="S3.Ex2.m2.3.3.2.2.2.2.2.4" mathsize="90%" xref="S3.Ex2.m2.3.3.2.2.2.2.3.cmml">,</mo><msubsup id="S3.Ex2.m2.3.3.2.2.2.2.2.2" xref="S3.Ex2.m2.3.3.2.2.2.2.2.2.cmml"><mi class="ltx_font_mathcaligraphic" id="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.2" mathsize="90%" xref="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.2.cmml">𝒟</mi><mrow id="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3" xref="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.cmml"><mi id="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.2" mathsize="90%" xref="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.2.cmml">s</mi><mo id="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.1" xref="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.1.cmml">⁢</mo><mi id="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.3" mathsize="90%" xref="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.3.cmml">o</mi><mo id="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.1a" xref="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.1.cmml">⁢</mo><mi id="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.4" mathsize="90%" xref="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.4.cmml">u</mi><mo id="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.1b" xref="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.1.cmml">⁢</mo><mi id="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.5" mathsize="90%" xref="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.5.cmml">r</mi><mo id="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.1c" xref="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.1.cmml">⁢</mo><mi id="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.6" mathsize="90%" xref="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.6.cmml">c</mi><mo id="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.1d" xref="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.1.cmml">⁢</mo><mi id="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.7" mathsize="90%" xref="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.7.cmml">e</mi></mrow><mrow id="S3.Ex2.m2.1.1.1" xref="S3.Ex2.m2.1.1.1.cmml"><mrow id="S3.Ex2.m2.1.1.1.3" xref="S3.Ex2.m2.1.1.1.3.cmml"><mi id="S3.Ex2.m2.1.1.1.3.2" mathsize="90%" xref="S3.Ex2.m2.1.1.1.3.2.cmml">m</mi><mo id="S3.Ex2.m2.1.1.1.3.1" xref="S3.Ex2.m2.1.1.1.3.1.cmml">⁢</mo><mi id="S3.Ex2.m2.1.1.1.3.3" mathsize="90%" xref="S3.Ex2.m2.1.1.1.3.3.cmml">e</mi><mo id="S3.Ex2.m2.1.1.1.3.1a" xref="S3.Ex2.m2.1.1.1.3.1.cmml">⁢</mo><mi id="S3.Ex2.m2.1.1.1.3.4" mathsize="90%" xref="S3.Ex2.m2.1.1.1.3.4.cmml">t</mi><mo id="S3.Ex2.m2.1.1.1.3.1b" xref="S3.Ex2.m2.1.1.1.3.1.cmml">⁢</mo><mi id="S3.Ex2.m2.1.1.1.3.5" mathsize="90%" xref="S3.Ex2.m2.1.1.1.3.5.cmml">a</mi></mrow><mo id="S3.Ex2.m2.1.1.1.2" mathsize="90%" xref="S3.Ex2.m2.1.1.1.2.cmml">−</mo><mrow id="S3.Ex2.m2.1.1.1.4" xref="S3.Ex2.m2.1.1.1.4.cmml"><mi id="S3.Ex2.m2.1.1.1.4.2" mathsize="90%" xref="S3.Ex2.m2.1.1.1.4.2.cmml">t</mi><mo id="S3.Ex2.m2.1.1.1.4.1" xref="S3.Ex2.m2.1.1.1.4.1.cmml">⁢</mo><mi id="S3.Ex2.m2.1.1.1.4.3" mathsize="90%" xref="S3.Ex2.m2.1.1.1.4.3.cmml">e</mi><mo id="S3.Ex2.m2.1.1.1.4.1a" xref="S3.Ex2.m2.1.1.1.4.1.cmml">⁢</mo><mi id="S3.Ex2.m2.1.1.1.4.4" mathsize="90%" xref="S3.Ex2.m2.1.1.1.4.4.cmml">s</mi><mo id="S3.Ex2.m2.1.1.1.4.1b" xref="S3.Ex2.m2.1.1.1.4.1.cmml">⁢</mo><mi id="S3.Ex2.m2.1.1.1.4.5" mathsize="90%" xref="S3.Ex2.m2.1.1.1.4.5.cmml">t</mi><mo id="S3.Ex2.m2.1.1.1.4.1c" xref="S3.Ex2.m2.1.1.1.4.1.cmml">⁢</mo><mrow id="S3.Ex2.m2.1.1.1.4.6.2" xref="S3.Ex2.m2.1.1.1.4.cmml"><mo id="S3.Ex2.m2.1.1.1.4.6.2.1" maxsize="90%" minsize="90%" xref="S3.Ex2.m2.1.1.1.4.cmml">(</mo><mi id="S3.Ex2.m2.1.1.1.1" mathsize="90%" xref="S3.Ex2.m2.1.1.1.1.cmml">i</mi><mo id="S3.Ex2.m2.1.1.1.4.6.2.2" maxsize="90%" minsize="90%" xref="S3.Ex2.m2.1.1.1.4.cmml">)</mo></mrow></mrow></mrow></msubsup><mo id="S3.Ex2.m2.3.3.2.2.2.2.2.5" maxsize="90%" minsize="90%" xref="S3.Ex2.m2.3.3.2.2.2.2.3.cmml">)</mo></mrow></mrow></mrow></mrow></mrow><annotation-xml encoding="MathML-Content" id="S3.Ex2.m2.3b"><apply id="S3.Ex2.m2.3.3.cmml" xref="S3.Ex2.m2.3.3"><eq id="S3.Ex2.m2.3.3.3.cmml" xref="S3.Ex2.m2.3.3.3"></eq><csymbol cd="latexml" id="S3.Ex2.m2.3.3.4.cmml" xref="S3.Ex2.m2.3.3.4">absent</csymbol><apply id="S3.Ex2.m2.3.3.2.cmml" xref="S3.Ex2.m2.3.3.2"><times id="S3.Ex2.m2.3.3.2.3.cmml" xref="S3.Ex2.m2.3.3.2.3"></times><apply id="S3.Ex2.m2.3.3.2.4.cmml" xref="S3.Ex2.m2.3.3.2.4"><csymbol cd="ambiguous" id="S3.Ex2.m2.3.3.2.4.1.cmml" xref="S3.Ex2.m2.3.3.2.4">subscript</csymbol><ci id="S3.Ex2.m2.3.3.2.4.2.cmml" xref="S3.Ex2.m2.3.3.2.4.2">argmin</ci><ci id="S3.Ex2.m2.3.3.2.4.3.cmml" xref="S3.Ex2.m2.3.3.2.4.3">𝜃</ci></apply><apply id="S3.Ex2.m2.3.3.2.2.cmml" xref="S3.Ex2.m2.3.3.2.2"><apply id="S3.Ex2.m2.3.3.2.2.3.cmml" xref="S3.Ex2.m2.3.3.2.2.3"><csymbol cd="ambiguous" id="S3.Ex2.m2.3.3.2.2.3.1.cmml" xref="S3.Ex2.m2.3.3.2.2.3">superscript</csymbol><apply id="S3.Ex2.m2.3.3.2.2.3.2.cmml" xref="S3.Ex2.m2.3.3.2.2.3"><csymbol cd="ambiguous" id="S3.Ex2.m2.3.3.2.2.3.2.1.cmml" xref="S3.Ex2.m2.3.3.2.2.3">subscript</csymbol><sum id="S3.Ex2.m2.3.3.2.2.3.2.2.cmml" xref="S3.Ex2.m2.3.3.2.2.3.2.2"></sum><ci id="S3.Ex2.m2.3.3.2.2.3.2.3.cmml" xref="S3.Ex2.m2.3.3.2.2.3.2.3">𝑖</ci></apply><ci id="S3.Ex2.m2.3.3.2.2.3.3.cmml" xref="S3.Ex2.m2.3.3.2.2.3.3">𝑇</ci></apply><apply id="S3.Ex2.m2.3.3.2.2.2.cmml" xref="S3.Ex2.m2.3.3.2.2.2"><times id="S3.Ex2.m2.3.3.2.2.2.3.cmml" xref="S3.Ex2.m2.3.3.2.2.2.3"></times><apply id="S3.Ex2.m2.3.3.2.2.2.4.cmml" xref="S3.Ex2.m2.3.3.2.2.2.4"><csymbol cd="ambiguous" id="S3.Ex2.m2.3.3.2.2.2.4.1.cmml" xref="S3.Ex2.m2.3.3.2.2.2.4">superscript</csymbol><ci id="S3.Ex2.m2.3.3.2.2.2.4.2.cmml" xref="S3.Ex2.m2.3.3.2.2.2.4.2">ℒ</ci><apply id="S3.Ex2.m2.3.3.2.2.2.4.3.cmml" xref="S3.Ex2.m2.3.3.2.2.2.4.3"><times id="S3.Ex2.m2.3.3.2.2.2.4.3.1.cmml" xref="S3.Ex2.m2.3.3.2.2.2.4.3.1"></times><ci id="S3.Ex2.m2.3.3.2.2.2.4.3.2.cmml" xref="S3.Ex2.m2.3.3.2.2.2.4.3.2">𝑜</ci><ci id="S3.Ex2.m2.3.3.2.2.2.4.3.3.cmml" xref="S3.Ex2.m2.3.3.2.2.2.4.3.3">𝑢</ci><ci id="S3.Ex2.m2.3.3.2.2.2.4.3.4.cmml" xref="S3.Ex2.m2.3.3.2.2.2.4.3.4">𝑡</ci><ci id="S3.Ex2.m2.3.3.2.2.2.4.3.5.cmml" xref="S3.Ex2.m2.3.3.2.2.2.4.3.5">𝑒</ci><ci id="S3.Ex2.m2.3.3.2.2.2.4.3.6.cmml" xref="S3.Ex2.m2.3.3.2.2.2.4.3.6">𝑟</ci></apply></apply><interval closure="open" id="S3.Ex2.m2.3.3.2.2.2.2.3.cmml" xref="S3.Ex2.m2.3.3.2.2.2.2.2"><apply id="S3.Ex2.m2.2.2.1.1.1.1.1.1.cmml" xref="S3.Ex2.m2.2.2.1.1.1.1.1.1"><csymbol cd="ambiguous" id="S3.Ex2.m2.2.2.1.1.1.1.1.1.1.cmml" xref="S3.Ex2.m2.2.2.1.1.1.1.1.1">superscript</csymbol><apply id="S3.Ex2.m2.2.2.1.1.1.1.1.1.2.cmml" xref="S3.Ex2.m2.2.2.1.1.1.1.1.1"><csymbol cd="ambiguous" id="S3.Ex2.m2.2.2.1.1.1.1.1.1.2.1.cmml" xref="S3.Ex2.m2.2.2.1.1.1.1.1.1">subscript</csymbol><ci id="S3.Ex2.m2.2.2.1.1.1.1.1.1.2.2.cmml" xref="S3.Ex2.m2.2.2.1.1.1.1.1.1.2.2">𝜃</ci><ci id="S3.Ex2.m2.2.2.1.1.1.1.1.1.2.3.cmml" xref="S3.Ex2.m2.2.2.1.1.1.1.1.1.2.3">𝑖</ci></apply><times id="S3.Ex2.m2.2.2.1.1.1.1.1.1.3.cmml" xref="S3.Ex2.m2.2.2.1.1.1.1.1.1.3"></times></apply><apply id="S3.Ex2.m2.3.3.2.2.2.2.2.2.cmml" xref="S3.Ex2.m2.3.3.2.2.2.2.2.2"><csymbol cd="ambiguous" id="S3.Ex2.m2.3.3.2.2.2.2.2.2.1.cmml" xref="S3.Ex2.m2.3.3.2.2.2.2.2.2">superscript</csymbol><apply id="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.cmml" xref="S3.Ex2.m2.3.3.2.2.2.2.2.2"><csymbol cd="ambiguous" id="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.1.cmml" xref="S3.Ex2.m2.3.3.2.2.2.2.2.2">subscript</csymbol><ci id="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.2.cmml" xref="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.2">𝒟</ci><apply id="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.cmml" xref="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3"><times id="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.1.cmml" xref="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.1"></times><ci id="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.2.cmml" xref="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.2">𝑠</ci><ci id="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.3.cmml" xref="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.3">𝑜</ci><ci id="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.4.cmml" xref="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.4">𝑢</ci><ci id="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.5.cmml" xref="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.5">𝑟</ci><ci id="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.6.cmml" xref="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.6">𝑐</ci><ci id="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.7.cmml" xref="S3.Ex2.m2.3.3.2.2.2.2.2.2.2.3.7">𝑒</ci></apply></apply><apply id="S3.Ex2.m2.1.1.1.cmml" xref="S3.Ex2.m2.1.1.1"><minus id="S3.Ex2.m2.1.1.1.2.cmml" xref="S3.Ex2.m2.1.1.1.2"></minus><apply id="S3.Ex2.m2.1.1.1.3.cmml" xref="S3.Ex2.m2.1.1.1.3"><times id="S3.Ex2.m2.1.1.1.3.1.cmml" xref="S3.Ex2.m2.1.1.1.3.1"></times><ci id="S3.Ex2.m2.1.1.1.3.2.cmml" xref="S3.Ex2.m2.1.1.1.3.2">𝑚</ci><ci id="S3.Ex2.m2.1.1.1.3.3.cmml" xref="S3.Ex2.m2.1.1.1.3.3">𝑒</ci><ci id="S3.Ex2.m2.1.1.1.3.4.cmml" xref="S3.Ex2.m2.1.1.1.3.4">𝑡</ci><ci id="S3.Ex2.m2.1.1.1.3.5.cmml" xref="S3.Ex2.m2.1.1.1.3.5">𝑎</ci></apply><apply id="S3.Ex2.m2.1.1.1.4.cmml" xref="S3.Ex2.m2.1.1.1.4"><times id="S3.Ex2.m2.1.1.1.4.1.cmml" xref="S3.Ex2.m2.1.1.1.4.1"></times><ci id="S3.Ex2.m2.1.1.1.4.2.cmml" xref="S3.Ex2.m2.1.1.1.4.2">𝑡</ci><ci id="S3.Ex2.m2.1.1.1.4.3.cmml" xref="S3.Ex2.m2.1.1.1.4.3">𝑒</ci><ci id="S3.Ex2.m2.1.1.1.4.4.cmml" xref="S3.Ex2.m2.1.1.1.4.4">𝑠</ci><ci id="S3.Ex2.m2.1.1.1.4.5.cmml" xref="S3.Ex2.m2.1.1.1.4.5">𝑡</ci><ci id="S3.Ex2.m2.1.1.1.1.cmml" xref="S3.Ex2.m2.1.1.1.1">𝑖</ci></apply></apply></apply></interval></apply></apply></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.Ex2.m2.3c">\displaystyle=\operatorname*{argmin}_{\theta}\sum_{i}^{T}\mathcal{L}^{outer}(% \theta_{i}^{*},\mathcal{D}_{source}^{meta-test(i)})</annotation><annotation encoding="application/x-llamapun" id="S3.Ex2.m2.3d">= roman_argmin start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT caligraphic_L start_POSTSUPERSCRIPT italic_o italic_u italic_t italic_e italic_r end_POSTSUPERSCRIPT ( italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , caligraphic_D start_POSTSUBSCRIPT italic_s italic_o italic_u italic_r italic_c italic_e end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_e italic_t italic_a - italic_t italic_e italic_s italic_t ( italic_i ) end_POSTSUPERSCRIPT )</annotation></semantics></math></td> <td class="ltx_eqn_cell ltx_eqn_center_padright"></td> </tr></tbody> <tbody id="S3.E12"><tr class="ltx_equation ltx_eqn_row ltx_align_baseline"> <td class="ltx_eqn_cell ltx_eqn_center_padleft"></td> <td class="ltx_td ltx_eqn_cell"></td> <td class="ltx_td ltx_align_left ltx_eqn_cell"><math alttext="\displaystyle\;\;\;\;+\mathcal{L}^{inner}(\theta_{i},\mathcal{D}_{source}^{% meta-train(i)})" class="ltx_Math" display="inline" id="S3.E12.m1.3"><semantics id="S3.E12.m1.3a"><mrow id="S3.E12.m1.3.3" xref="S3.E12.m1.3.3.cmml"><mo id="S3.E12.m1.3.3a" mathsize="90%" xref="S3.E12.m1.3.3.cmml">+</mo><mrow id="S3.E12.m1.3.3.2" xref="S3.E12.m1.3.3.2.cmml"><msup id="S3.E12.m1.3.3.2.4" xref="S3.E12.m1.3.3.2.4.cmml"><mi class="ltx_font_mathcaligraphic" id="S3.E12.m1.3.3.2.4.2" mathsize="90%" xref="S3.E12.m1.3.3.2.4.2.cmml">ℒ</mi><mrow id="S3.E12.m1.3.3.2.4.3" xref="S3.E12.m1.3.3.2.4.3.cmml"><mi id="S3.E12.m1.3.3.2.4.3.2" mathsize="90%" xref="S3.E12.m1.3.3.2.4.3.2.cmml">i</mi><mo id="S3.E12.m1.3.3.2.4.3.1" xref="S3.E12.m1.3.3.2.4.3.1.cmml">⁢</mo><mi id="S3.E12.m1.3.3.2.4.3.3" mathsize="90%" xref="S3.E12.m1.3.3.2.4.3.3.cmml">n</mi><mo id="S3.E12.m1.3.3.2.4.3.1a" xref="S3.E12.m1.3.3.2.4.3.1.cmml">⁢</mo><mi id="S3.E12.m1.3.3.2.4.3.4" mathsize="90%" xref="S3.E12.m1.3.3.2.4.3.4.cmml">n</mi><mo id="S3.E12.m1.3.3.2.4.3.1b" xref="S3.E12.m1.3.3.2.4.3.1.cmml">⁢</mo><mi id="S3.E12.m1.3.3.2.4.3.5" mathsize="90%" xref="S3.E12.m1.3.3.2.4.3.5.cmml">e</mi><mo id="S3.E12.m1.3.3.2.4.3.1c" xref="S3.E12.m1.3.3.2.4.3.1.cmml">⁢</mo><mi id="S3.E12.m1.3.3.2.4.3.6" mathsize="90%" xref="S3.E12.m1.3.3.2.4.3.6.cmml">r</mi></mrow></msup><mo id="S3.E12.m1.3.3.2.3" xref="S3.E12.m1.3.3.2.3.cmml">⁢</mo><mrow id="S3.E12.m1.3.3.2.2.2" xref="S3.E12.m1.3.3.2.2.3.cmml"><mo id="S3.E12.m1.3.3.2.2.2.3" maxsize="90%" minsize="90%" xref="S3.E12.m1.3.3.2.2.3.cmml">(</mo><msub id="S3.E12.m1.2.2.1.1.1.1" xref="S3.E12.m1.2.2.1.1.1.1.cmml"><mi id="S3.E12.m1.2.2.1.1.1.1.2" mathsize="90%" xref="S3.E12.m1.2.2.1.1.1.1.2.cmml">θ</mi><mi id="S3.E12.m1.2.2.1.1.1.1.3" mathsize="90%" xref="S3.E12.m1.2.2.1.1.1.1.3.cmml">i</mi></msub><mo id="S3.E12.m1.3.3.2.2.2.4" mathsize="90%" xref="S3.E12.m1.3.3.2.2.3.cmml">,</mo><msubsup id="S3.E12.m1.3.3.2.2.2.2" xref="S3.E12.m1.3.3.2.2.2.2.cmml"><mi class="ltx_font_mathcaligraphic" id="S3.E12.m1.3.3.2.2.2.2.2.2" mathsize="90%" xref="S3.E12.m1.3.3.2.2.2.2.2.2.cmml">𝒟</mi><mrow id="S3.E12.m1.3.3.2.2.2.2.2.3" xref="S3.E12.m1.3.3.2.2.2.2.2.3.cmml"><mi id="S3.E12.m1.3.3.2.2.2.2.2.3.2" mathsize="90%" xref="S3.E12.m1.3.3.2.2.2.2.2.3.2.cmml">s</mi><mo id="S3.E12.m1.3.3.2.2.2.2.2.3.1" xref="S3.E12.m1.3.3.2.2.2.2.2.3.1.cmml">⁢</mo><mi id="S3.E12.m1.3.3.2.2.2.2.2.3.3" mathsize="90%" xref="S3.E12.m1.3.3.2.2.2.2.2.3.3.cmml">o</mi><mo id="S3.E12.m1.3.3.2.2.2.2.2.3.1a" xref="S3.E12.m1.3.3.2.2.2.2.2.3.1.cmml">⁢</mo><mi id="S3.E12.m1.3.3.2.2.2.2.2.3.4" mathsize="90%" xref="S3.E12.m1.3.3.2.2.2.2.2.3.4.cmml">u</mi><mo id="S3.E12.m1.3.3.2.2.2.2.2.3.1b" xref="S3.E12.m1.3.3.2.2.2.2.2.3.1.cmml">⁢</mo><mi id="S3.E12.m1.3.3.2.2.2.2.2.3.5" mathsize="90%" xref="S3.E12.m1.3.3.2.2.2.2.2.3.5.cmml">r</mi><mo id="S3.E12.m1.3.3.2.2.2.2.2.3.1c" xref="S3.E12.m1.3.3.2.2.2.2.2.3.1.cmml">⁢</mo><mi id="S3.E12.m1.3.3.2.2.2.2.2.3.6" mathsize="90%" xref="S3.E12.m1.3.3.2.2.2.2.2.3.6.cmml">c</mi><mo id="S3.E12.m1.3.3.2.2.2.2.2.3.1d" xref="S3.E12.m1.3.3.2.2.2.2.2.3.1.cmml">⁢</mo><mi id="S3.E12.m1.3.3.2.2.2.2.2.3.7" mathsize="90%" xref="S3.E12.m1.3.3.2.2.2.2.2.3.7.cmml">e</mi></mrow><mrow id="S3.E12.m1.1.1.1" xref="S3.E12.m1.1.1.1.cmml"><mrow id="S3.E12.m1.1.1.1.3" xref="S3.E12.m1.1.1.1.3.cmml"><mi id="S3.E12.m1.1.1.1.3.2" mathsize="90%" xref="S3.E12.m1.1.1.1.3.2.cmml">m</mi><mo id="S3.E12.m1.1.1.1.3.1" xref="S3.E12.m1.1.1.1.3.1.cmml">⁢</mo><mi id="S3.E12.m1.1.1.1.3.3" mathsize="90%" xref="S3.E12.m1.1.1.1.3.3.cmml">e</mi><mo id="S3.E12.m1.1.1.1.3.1a" xref="S3.E12.m1.1.1.1.3.1.cmml">⁢</mo><mi id="S3.E12.m1.1.1.1.3.4" mathsize="90%" xref="S3.E12.m1.1.1.1.3.4.cmml">t</mi><mo id="S3.E12.m1.1.1.1.3.1b" xref="S3.E12.m1.1.1.1.3.1.cmml">⁢</mo><mi id="S3.E12.m1.1.1.1.3.5" mathsize="90%" xref="S3.E12.m1.1.1.1.3.5.cmml">a</mi></mrow><mo id="S3.E12.m1.1.1.1.2" mathsize="90%" xref="S3.E12.m1.1.1.1.2.cmml">−</mo><mrow id="S3.E12.m1.1.1.1.4" xref="S3.E12.m1.1.1.1.4.cmml"><mi id="S3.E12.m1.1.1.1.4.2" mathsize="90%" xref="S3.E12.m1.1.1.1.4.2.cmml">t</mi><mo id="S3.E12.m1.1.1.1.4.1" xref="S3.E12.m1.1.1.1.4.1.cmml">⁢</mo><mi id="S3.E12.m1.1.1.1.4.3" mathsize="90%" xref="S3.E12.m1.1.1.1.4.3.cmml">r</mi><mo id="S3.E12.m1.1.1.1.4.1a" xref="S3.E12.m1.1.1.1.4.1.cmml">⁢</mo><mi id="S3.E12.m1.1.1.1.4.4" mathsize="90%" xref="S3.E12.m1.1.1.1.4.4.cmml">a</mi><mo id="S3.E12.m1.1.1.1.4.1b" xref="S3.E12.m1.1.1.1.4.1.cmml">⁢</mo><mi id="S3.E12.m1.1.1.1.4.5" mathsize="90%" xref="S3.E12.m1.1.1.1.4.5.cmml">i</mi><mo id="S3.E12.m1.1.1.1.4.1c" xref="S3.E12.m1.1.1.1.4.1.cmml">⁢</mo><mi id="S3.E12.m1.1.1.1.4.6" mathsize="90%" xref="S3.E12.m1.1.1.1.4.6.cmml">n</mi><mo id="S3.E12.m1.1.1.1.4.1d" xref="S3.E12.m1.1.1.1.4.1.cmml">⁢</mo><mrow id="S3.E12.m1.1.1.1.4.7.2" xref="S3.E12.m1.1.1.1.4.cmml"><mo id="S3.E12.m1.1.1.1.4.7.2.1" maxsize="90%" minsize="90%" xref="S3.E12.m1.1.1.1.4.cmml">(</mo><mi id="S3.E12.m1.1.1.1.1" mathsize="90%" xref="S3.E12.m1.1.1.1.1.cmml">i</mi><mo id="S3.E12.m1.1.1.1.4.7.2.2" maxsize="90%" minsize="90%" xref="S3.E12.m1.1.1.1.4.cmml">)</mo></mrow></mrow></mrow></msubsup><mo id="S3.E12.m1.3.3.2.2.2.5" maxsize="90%" minsize="90%" xref="S3.E12.m1.3.3.2.2.3.cmml">)</mo></mrow></mrow></mrow><annotation-xml encoding="MathML-Content" id="S3.E12.m1.3b"><apply id="S3.E12.m1.3.3.cmml" xref="S3.E12.m1.3.3"><plus id="S3.E12.m1.3.3.3.cmml" xref="S3.E12.m1.3.3"></plus><apply id="S3.E12.m1.3.3.2.cmml" xref="S3.E12.m1.3.3.2"><times id="S3.E12.m1.3.3.2.3.cmml" xref="S3.E12.m1.3.3.2.3"></times><apply id="S3.E12.m1.3.3.2.4.cmml" xref="S3.E12.m1.3.3.2.4"><csymbol cd="ambiguous" id="S3.E12.m1.3.3.2.4.1.cmml" xref="S3.E12.m1.3.3.2.4">superscript</csymbol><ci id="S3.E12.m1.3.3.2.4.2.cmml" xref="S3.E12.m1.3.3.2.4.2">ℒ</ci><apply id="S3.E12.m1.3.3.2.4.3.cmml" xref="S3.E12.m1.3.3.2.4.3"><times id="S3.E12.m1.3.3.2.4.3.1.cmml" xref="S3.E12.m1.3.3.2.4.3.1"></times><ci id="S3.E12.m1.3.3.2.4.3.2.cmml" xref="S3.E12.m1.3.3.2.4.3.2">𝑖</ci><ci id="S3.E12.m1.3.3.2.4.3.3.cmml" xref="S3.E12.m1.3.3.2.4.3.3">𝑛</ci><ci id="S3.E12.m1.3.3.2.4.3.4.cmml" xref="S3.E12.m1.3.3.2.4.3.4">𝑛</ci><ci id="S3.E12.m1.3.3.2.4.3.5.cmml" xref="S3.E12.m1.3.3.2.4.3.5">𝑒</ci><ci id="S3.E12.m1.3.3.2.4.3.6.cmml" xref="S3.E12.m1.3.3.2.4.3.6">𝑟</ci></apply></apply><interval closure="open" id="S3.E12.m1.3.3.2.2.3.cmml" xref="S3.E12.m1.3.3.2.2.2"><apply id="S3.E12.m1.2.2.1.1.1.1.cmml" xref="S3.E12.m1.2.2.1.1.1.1"><csymbol cd="ambiguous" id="S3.E12.m1.2.2.1.1.1.1.1.cmml" xref="S3.E12.m1.2.2.1.1.1.1">subscript</csymbol><ci id="S3.E12.m1.2.2.1.1.1.1.2.cmml" xref="S3.E12.m1.2.2.1.1.1.1.2">𝜃</ci><ci id="S3.E12.m1.2.2.1.1.1.1.3.cmml" xref="S3.E12.m1.2.2.1.1.1.1.3">𝑖</ci></apply><apply id="S3.E12.m1.3.3.2.2.2.2.cmml" xref="S3.E12.m1.3.3.2.2.2.2"><csymbol cd="ambiguous" id="S3.E12.m1.3.3.2.2.2.2.1.cmml" xref="S3.E12.m1.3.3.2.2.2.2">superscript</csymbol><apply id="S3.E12.m1.3.3.2.2.2.2.2.cmml" xref="S3.E12.m1.3.3.2.2.2.2"><csymbol cd="ambiguous" id="S3.E12.m1.3.3.2.2.2.2.2.1.cmml" xref="S3.E12.m1.3.3.2.2.2.2">subscript</csymbol><ci id="S3.E12.m1.3.3.2.2.2.2.2.2.cmml" xref="S3.E12.m1.3.3.2.2.2.2.2.2">𝒟</ci><apply id="S3.E12.m1.3.3.2.2.2.2.2.3.cmml" xref="S3.E12.m1.3.3.2.2.2.2.2.3"><times id="S3.E12.m1.3.3.2.2.2.2.2.3.1.cmml" xref="S3.E12.m1.3.3.2.2.2.2.2.3.1"></times><ci id="S3.E12.m1.3.3.2.2.2.2.2.3.2.cmml" xref="S3.E12.m1.3.3.2.2.2.2.2.3.2">𝑠</ci><ci id="S3.E12.m1.3.3.2.2.2.2.2.3.3.cmml" xref="S3.E12.m1.3.3.2.2.2.2.2.3.3">𝑜</ci><ci id="S3.E12.m1.3.3.2.2.2.2.2.3.4.cmml" xref="S3.E12.m1.3.3.2.2.2.2.2.3.4">𝑢</ci><ci id="S3.E12.m1.3.3.2.2.2.2.2.3.5.cmml" xref="S3.E12.m1.3.3.2.2.2.2.2.3.5">𝑟</ci><ci id="S3.E12.m1.3.3.2.2.2.2.2.3.6.cmml" xref="S3.E12.m1.3.3.2.2.2.2.2.3.6">𝑐</ci><ci id="S3.E12.m1.3.3.2.2.2.2.2.3.7.cmml" xref="S3.E12.m1.3.3.2.2.2.2.2.3.7">𝑒</ci></apply></apply><apply id="S3.E12.m1.1.1.1.cmml" xref="S3.E12.m1.1.1.1"><minus id="S3.E12.m1.1.1.1.2.cmml" xref="S3.E12.m1.1.1.1.2"></minus><apply id="S3.E12.m1.1.1.1.3.cmml" xref="S3.E12.m1.1.1.1.3"><times id="S3.E12.m1.1.1.1.3.1.cmml" xref="S3.E12.m1.1.1.1.3.1"></times><ci id="S3.E12.m1.1.1.1.3.2.cmml" xref="S3.E12.m1.1.1.1.3.2">𝑚</ci><ci id="S3.E12.m1.1.1.1.3.3.cmml" xref="S3.E12.m1.1.1.1.3.3">𝑒</ci><ci id="S3.E12.m1.1.1.1.3.4.cmml" xref="S3.E12.m1.1.1.1.3.4">𝑡</ci><ci id="S3.E12.m1.1.1.1.3.5.cmml" xref="S3.E12.m1.1.1.1.3.5">𝑎</ci></apply><apply id="S3.E12.m1.1.1.1.4.cmml" xref="S3.E12.m1.1.1.1.4"><times id="S3.E12.m1.1.1.1.4.1.cmml" xref="S3.E12.m1.1.1.1.4.1"></times><ci id="S3.E12.m1.1.1.1.4.2.cmml" xref="S3.E12.m1.1.1.1.4.2">𝑡</ci><ci id="S3.E12.m1.1.1.1.4.3.cmml" xref="S3.E12.m1.1.1.1.4.3">𝑟</ci><ci id="S3.E12.m1.1.1.1.4.4.cmml" xref="S3.E12.m1.1.1.1.4.4">𝑎</ci><ci id="S3.E12.m1.1.1.1.4.5.cmml" xref="S3.E12.m1.1.1.1.4.5">𝑖</ci><ci id="S3.E12.m1.1.1.1.4.6.cmml" xref="S3.E12.m1.1.1.1.4.6">𝑛</ci><ci id="S3.E12.m1.1.1.1.1.cmml" xref="S3.E12.m1.1.1.1.1">𝑖</ci></apply></apply></apply></interval></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.E12.m1.3c">\displaystyle\;\;\;\;+\mathcal{L}^{inner}(\theta_{i},\mathcal{D}_{source}^{% meta-train(i)})</annotation><annotation encoding="application/x-llamapun" id="S3.E12.m1.3d">+ caligraphic_L start_POSTSUPERSCRIPT italic_i italic_n italic_n italic_e italic_r end_POSTSUPERSCRIPT ( italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , caligraphic_D start_POSTSUBSCRIPT italic_s italic_o italic_u italic_r italic_c italic_e end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_e italic_t italic_a - italic_t italic_r italic_a italic_i italic_n ( italic_i ) end_POSTSUPERSCRIPT )</annotation></semantics></math></td> <td class="ltx_eqn_cell ltx_eqn_center_padright"></td> <td class="ltx_eqn_cell ltx_eqn_eqno ltx_align_middle ltx_align_right" rowspan="1"><span class="ltx_tag ltx_tag_equation ltx_align_right">(12)</span></td> </tr></tbody> <tbody id="S3.E13"><tr class="ltx_equation ltx_eqn_row ltx_align_baseline"> <td class="ltx_eqn_cell ltx_eqn_center_padleft"></td> <td class="ltx_td ltx_align_right ltx_eqn_cell"><math alttext="\displaystyle\text{s}.\text{t}.\quad\theta_{i}^{*}" class="ltx_Math" display="inline" id="S3.E13.m1.3"><semantics id="S3.E13.m1.3a"><mrow id="S3.E13.m1.3.3.1" xref="S3.E13.m1.3.3.2.cmml"><mtext id="S3.E13.m1.1.1" mathsize="90%" xref="S3.E13.m1.1.1a.cmml">s</mtext><mo id="S3.E13.m1.3.3.1.2" lspace="0em" mathsize="90%" rspace="0.167em" xref="S3.E13.m1.3.3.2a.cmml">.</mo><mtext id="S3.E13.m1.2.2" mathsize="90%" xref="S3.E13.m1.2.2a.cmml">t</mtext><mo id="S3.E13.m1.3.3.1.3" lspace="0em" mathsize="90%" rspace="1.067em" xref="S3.E13.m1.3.3.2a.cmml">.</mo><msubsup id="S3.E13.m1.3.3.1.1" xref="S3.E13.m1.3.3.1.1.cmml"><mi id="S3.E13.m1.3.3.1.1.2.2" mathsize="90%" xref="S3.E13.m1.3.3.1.1.2.2.cmml">θ</mi><mi id="S3.E13.m1.3.3.1.1.2.3" mathsize="90%" xref="S3.E13.m1.3.3.1.1.2.3.cmml">i</mi><mo id="S3.E13.m1.3.3.1.1.3" mathsize="90%" xref="S3.E13.m1.3.3.1.1.3.cmml">∗</mo></msubsup></mrow><annotation-xml encoding="MathML-Content" id="S3.E13.m1.3b"><apply id="S3.E13.m1.3.3.2.cmml" xref="S3.E13.m1.3.3.1"><csymbol cd="ambiguous" id="S3.E13.m1.3.3.2a.cmml" xref="S3.E13.m1.3.3.1.2">formulae-sequence</csymbol><ci id="S3.E13.m1.1.1a.cmml" xref="S3.E13.m1.1.1"><mtext id="S3.E13.m1.1.1.cmml" mathsize="90%" xref="S3.E13.m1.1.1">s</mtext></ci><ci id="S3.E13.m1.2.2a.cmml" xref="S3.E13.m1.2.2"><mtext id="S3.E13.m1.2.2.cmml" mathsize="90%" xref="S3.E13.m1.2.2">t</mtext></ci><apply id="S3.E13.m1.3.3.1.1.cmml" xref="S3.E13.m1.3.3.1.1"><csymbol cd="ambiguous" id="S3.E13.m1.3.3.1.1.1.cmml" xref="S3.E13.m1.3.3.1.1">superscript</csymbol><apply id="S3.E13.m1.3.3.1.1.2.cmml" xref="S3.E13.m1.3.3.1.1"><csymbol cd="ambiguous" id="S3.E13.m1.3.3.1.1.2.1.cmml" xref="S3.E13.m1.3.3.1.1">subscript</csymbol><ci id="S3.E13.m1.3.3.1.1.2.2.cmml" xref="S3.E13.m1.3.3.1.1.2.2">𝜃</ci><ci id="S3.E13.m1.3.3.1.1.2.3.cmml" xref="S3.E13.m1.3.3.1.1.2.3">𝑖</ci></apply><times id="S3.E13.m1.3.3.1.1.3.cmml" xref="S3.E13.m1.3.3.1.1.3"></times></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.E13.m1.3c">\displaystyle\text{s}.\text{t}.\quad\theta_{i}^{*}</annotation><annotation encoding="application/x-llamapun" id="S3.E13.m1.3d">s . t . italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT</annotation></semantics></math></td> <td class="ltx_td ltx_align_left ltx_eqn_cell"><math alttext="\displaystyle=\operatorname*{argmin}_{\theta}\mathcal{L}^{inner}(\theta_{i},% \mathcal{D}_{source}^{meta-train(i)})," class="ltx_Math" display="inline" id="S3.E13.m2.2"><semantics id="S3.E13.m2.2a"><mrow id="S3.E13.m2.2.2.1" xref="S3.E13.m2.2.2.1.1.cmml"><mrow id="S3.E13.m2.2.2.1.1" xref="S3.E13.m2.2.2.1.1.cmml"><mi id="S3.E13.m2.2.2.1.1.4" xref="S3.E13.m2.2.2.1.1.4.cmml"></mi><mo id="S3.E13.m2.2.2.1.1.3" mathsize="90%" rspace="0.1389em" xref="S3.E13.m2.2.2.1.1.3.cmml">=</mo><mrow id="S3.E13.m2.2.2.1.1.2" xref="S3.E13.m2.2.2.1.1.2.cmml"><mrow id="S3.E13.m2.2.2.1.1.2.4" xref="S3.E13.m2.2.2.1.1.2.4.cmml"><munder id="S3.E13.m2.2.2.1.1.2.4.1" xref="S3.E13.m2.2.2.1.1.2.4.1.cmml"><mo id="S3.E13.m2.2.2.1.1.2.4.1.2" lspace="0.1389em" mathsize="90%" rspace="0.167em" xref="S3.E13.m2.2.2.1.1.2.4.1.2.cmml">argmin</mo><mi id="S3.E13.m2.2.2.1.1.2.4.1.3" mathsize="90%" xref="S3.E13.m2.2.2.1.1.2.4.1.3.cmml">θ</mi></munder><msup id="S3.E13.m2.2.2.1.1.2.4.2" xref="S3.E13.m2.2.2.1.1.2.4.2.cmml"><mi class="ltx_font_mathcaligraphic" id="S3.E13.m2.2.2.1.1.2.4.2.2" mathsize="90%" xref="S3.E13.m2.2.2.1.1.2.4.2.2.cmml">ℒ</mi><mrow id="S3.E13.m2.2.2.1.1.2.4.2.3" xref="S3.E13.m2.2.2.1.1.2.4.2.3.cmml"><mi id="S3.E13.m2.2.2.1.1.2.4.2.3.2" mathsize="90%" xref="S3.E13.m2.2.2.1.1.2.4.2.3.2.cmml">i</mi><mo id="S3.E13.m2.2.2.1.1.2.4.2.3.1" xref="S3.E13.m2.2.2.1.1.2.4.2.3.1.cmml">⁢</mo><mi id="S3.E13.m2.2.2.1.1.2.4.2.3.3" mathsize="90%" xref="S3.E13.m2.2.2.1.1.2.4.2.3.3.cmml">n</mi><mo id="S3.E13.m2.2.2.1.1.2.4.2.3.1a" xref="S3.E13.m2.2.2.1.1.2.4.2.3.1.cmml">⁢</mo><mi id="S3.E13.m2.2.2.1.1.2.4.2.3.4" mathsize="90%" xref="S3.E13.m2.2.2.1.1.2.4.2.3.4.cmml">n</mi><mo id="S3.E13.m2.2.2.1.1.2.4.2.3.1b" xref="S3.E13.m2.2.2.1.1.2.4.2.3.1.cmml">⁢</mo><mi id="S3.E13.m2.2.2.1.1.2.4.2.3.5" mathsize="90%" xref="S3.E13.m2.2.2.1.1.2.4.2.3.5.cmml">e</mi><mo id="S3.E13.m2.2.2.1.1.2.4.2.3.1c" xref="S3.E13.m2.2.2.1.1.2.4.2.3.1.cmml">⁢</mo><mi id="S3.E13.m2.2.2.1.1.2.4.2.3.6" mathsize="90%" xref="S3.E13.m2.2.2.1.1.2.4.2.3.6.cmml">r</mi></mrow></msup></mrow><mo id="S3.E13.m2.2.2.1.1.2.3" xref="S3.E13.m2.2.2.1.1.2.3.cmml">⁢</mo><mrow id="S3.E13.m2.2.2.1.1.2.2.2" xref="S3.E13.m2.2.2.1.1.2.2.3.cmml"><mo id="S3.E13.m2.2.2.1.1.2.2.2.3" maxsize="90%" minsize="90%" xref="S3.E13.m2.2.2.1.1.2.2.3.cmml">(</mo><msub id="S3.E13.m2.2.2.1.1.1.1.1.1" xref="S3.E13.m2.2.2.1.1.1.1.1.1.cmml"><mi id="S3.E13.m2.2.2.1.1.1.1.1.1.2" mathsize="90%" xref="S3.E13.m2.2.2.1.1.1.1.1.1.2.cmml">θ</mi><mi id="S3.E13.m2.2.2.1.1.1.1.1.1.3" mathsize="90%" xref="S3.E13.m2.2.2.1.1.1.1.1.1.3.cmml">i</mi></msub><mo id="S3.E13.m2.2.2.1.1.2.2.2.4" mathsize="90%" xref="S3.E13.m2.2.2.1.1.2.2.3.cmml">,</mo><msubsup id="S3.E13.m2.2.2.1.1.2.2.2.2" xref="S3.E13.m2.2.2.1.1.2.2.2.2.cmml"><mi class="ltx_font_mathcaligraphic" id="S3.E13.m2.2.2.1.1.2.2.2.2.2.2" mathsize="90%" xref="S3.E13.m2.2.2.1.1.2.2.2.2.2.2.cmml">𝒟</mi><mrow id="S3.E13.m2.2.2.1.1.2.2.2.2.2.3" xref="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.cmml"><mi id="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.2" mathsize="90%" xref="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.2.cmml">s</mi><mo id="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.1" xref="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.1.cmml">⁢</mo><mi id="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.3" mathsize="90%" xref="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.3.cmml">o</mi><mo id="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.1a" xref="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.1.cmml">⁢</mo><mi id="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.4" mathsize="90%" xref="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.4.cmml">u</mi><mo id="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.1b" xref="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.1.cmml">⁢</mo><mi id="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.5" mathsize="90%" xref="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.5.cmml">r</mi><mo id="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.1c" xref="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.1.cmml">⁢</mo><mi id="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.6" mathsize="90%" xref="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.6.cmml">c</mi><mo id="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.1d" xref="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.1.cmml">⁢</mo><mi id="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.7" mathsize="90%" xref="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.7.cmml">e</mi></mrow><mrow id="S3.E13.m2.1.1.1" xref="S3.E13.m2.1.1.1.cmml"><mrow id="S3.E13.m2.1.1.1.3" xref="S3.E13.m2.1.1.1.3.cmml"><mi id="S3.E13.m2.1.1.1.3.2" mathsize="90%" xref="S3.E13.m2.1.1.1.3.2.cmml">m</mi><mo id="S3.E13.m2.1.1.1.3.1" xref="S3.E13.m2.1.1.1.3.1.cmml">⁢</mo><mi id="S3.E13.m2.1.1.1.3.3" mathsize="90%" xref="S3.E13.m2.1.1.1.3.3.cmml">e</mi><mo id="S3.E13.m2.1.1.1.3.1a" xref="S3.E13.m2.1.1.1.3.1.cmml">⁢</mo><mi id="S3.E13.m2.1.1.1.3.4" mathsize="90%" xref="S3.E13.m2.1.1.1.3.4.cmml">t</mi><mo id="S3.E13.m2.1.1.1.3.1b" xref="S3.E13.m2.1.1.1.3.1.cmml">⁢</mo><mi id="S3.E13.m2.1.1.1.3.5" mathsize="90%" xref="S3.E13.m2.1.1.1.3.5.cmml">a</mi></mrow><mo id="S3.E13.m2.1.1.1.2" mathsize="90%" xref="S3.E13.m2.1.1.1.2.cmml">−</mo><mrow id="S3.E13.m2.1.1.1.4" xref="S3.E13.m2.1.1.1.4.cmml"><mi id="S3.E13.m2.1.1.1.4.2" mathsize="90%" xref="S3.E13.m2.1.1.1.4.2.cmml">t</mi><mo id="S3.E13.m2.1.1.1.4.1" xref="S3.E13.m2.1.1.1.4.1.cmml">⁢</mo><mi id="S3.E13.m2.1.1.1.4.3" mathsize="90%" xref="S3.E13.m2.1.1.1.4.3.cmml">r</mi><mo id="S3.E13.m2.1.1.1.4.1a" xref="S3.E13.m2.1.1.1.4.1.cmml">⁢</mo><mi id="S3.E13.m2.1.1.1.4.4" mathsize="90%" xref="S3.E13.m2.1.1.1.4.4.cmml">a</mi><mo id="S3.E13.m2.1.1.1.4.1b" xref="S3.E13.m2.1.1.1.4.1.cmml">⁢</mo><mi id="S3.E13.m2.1.1.1.4.5" mathsize="90%" xref="S3.E13.m2.1.1.1.4.5.cmml">i</mi><mo id="S3.E13.m2.1.1.1.4.1c" xref="S3.E13.m2.1.1.1.4.1.cmml">⁢</mo><mi id="S3.E13.m2.1.1.1.4.6" mathsize="90%" xref="S3.E13.m2.1.1.1.4.6.cmml">n</mi><mo id="S3.E13.m2.1.1.1.4.1d" xref="S3.E13.m2.1.1.1.4.1.cmml">⁢</mo><mrow id="S3.E13.m2.1.1.1.4.7.2" xref="S3.E13.m2.1.1.1.4.cmml"><mo id="S3.E13.m2.1.1.1.4.7.2.1" maxsize="90%" minsize="90%" xref="S3.E13.m2.1.1.1.4.cmml">(</mo><mi id="S3.E13.m2.1.1.1.1" mathsize="90%" xref="S3.E13.m2.1.1.1.1.cmml">i</mi><mo id="S3.E13.m2.1.1.1.4.7.2.2" maxsize="90%" minsize="90%" xref="S3.E13.m2.1.1.1.4.cmml">)</mo></mrow></mrow></mrow></msubsup><mo id="S3.E13.m2.2.2.1.1.2.2.2.5" maxsize="90%" minsize="90%" xref="S3.E13.m2.2.2.1.1.2.2.3.cmml">)</mo></mrow></mrow></mrow><mo id="S3.E13.m2.2.2.1.2" mathsize="90%" xref="S3.E13.m2.2.2.1.1.cmml">,</mo></mrow><annotation-xml encoding="MathML-Content" id="S3.E13.m2.2b"><apply id="S3.E13.m2.2.2.1.1.cmml" xref="S3.E13.m2.2.2.1"><eq id="S3.E13.m2.2.2.1.1.3.cmml" xref="S3.E13.m2.2.2.1.1.3"></eq><csymbol cd="latexml" id="S3.E13.m2.2.2.1.1.4.cmml" xref="S3.E13.m2.2.2.1.1.4">absent</csymbol><apply id="S3.E13.m2.2.2.1.1.2.cmml" xref="S3.E13.m2.2.2.1.1.2"><times id="S3.E13.m2.2.2.1.1.2.3.cmml" xref="S3.E13.m2.2.2.1.1.2.3"></times><apply id="S3.E13.m2.2.2.1.1.2.4.cmml" xref="S3.E13.m2.2.2.1.1.2.4"><apply id="S3.E13.m2.2.2.1.1.2.4.1.cmml" xref="S3.E13.m2.2.2.1.1.2.4.1"><csymbol cd="ambiguous" id="S3.E13.m2.2.2.1.1.2.4.1.1.cmml" xref="S3.E13.m2.2.2.1.1.2.4.1">subscript</csymbol><ci id="S3.E13.m2.2.2.1.1.2.4.1.2.cmml" xref="S3.E13.m2.2.2.1.1.2.4.1.2">argmin</ci><ci id="S3.E13.m2.2.2.1.1.2.4.1.3.cmml" xref="S3.E13.m2.2.2.1.1.2.4.1.3">𝜃</ci></apply><apply id="S3.E13.m2.2.2.1.1.2.4.2.cmml" xref="S3.E13.m2.2.2.1.1.2.4.2"><csymbol cd="ambiguous" id="S3.E13.m2.2.2.1.1.2.4.2.1.cmml" xref="S3.E13.m2.2.2.1.1.2.4.2">superscript</csymbol><ci id="S3.E13.m2.2.2.1.1.2.4.2.2.cmml" xref="S3.E13.m2.2.2.1.1.2.4.2.2">ℒ</ci><apply id="S3.E13.m2.2.2.1.1.2.4.2.3.cmml" xref="S3.E13.m2.2.2.1.1.2.4.2.3"><times id="S3.E13.m2.2.2.1.1.2.4.2.3.1.cmml" xref="S3.E13.m2.2.2.1.1.2.4.2.3.1"></times><ci id="S3.E13.m2.2.2.1.1.2.4.2.3.2.cmml" xref="S3.E13.m2.2.2.1.1.2.4.2.3.2">𝑖</ci><ci id="S3.E13.m2.2.2.1.1.2.4.2.3.3.cmml" xref="S3.E13.m2.2.2.1.1.2.4.2.3.3">𝑛</ci><ci id="S3.E13.m2.2.2.1.1.2.4.2.3.4.cmml" xref="S3.E13.m2.2.2.1.1.2.4.2.3.4">𝑛</ci><ci id="S3.E13.m2.2.2.1.1.2.4.2.3.5.cmml" xref="S3.E13.m2.2.2.1.1.2.4.2.3.5">𝑒</ci><ci id="S3.E13.m2.2.2.1.1.2.4.2.3.6.cmml" xref="S3.E13.m2.2.2.1.1.2.4.2.3.6">𝑟</ci></apply></apply></apply><interval closure="open" id="S3.E13.m2.2.2.1.1.2.2.3.cmml" xref="S3.E13.m2.2.2.1.1.2.2.2"><apply id="S3.E13.m2.2.2.1.1.1.1.1.1.cmml" xref="S3.E13.m2.2.2.1.1.1.1.1.1"><csymbol cd="ambiguous" id="S3.E13.m2.2.2.1.1.1.1.1.1.1.cmml" xref="S3.E13.m2.2.2.1.1.1.1.1.1">subscript</csymbol><ci id="S3.E13.m2.2.2.1.1.1.1.1.1.2.cmml" xref="S3.E13.m2.2.2.1.1.1.1.1.1.2">𝜃</ci><ci id="S3.E13.m2.2.2.1.1.1.1.1.1.3.cmml" xref="S3.E13.m2.2.2.1.1.1.1.1.1.3">𝑖</ci></apply><apply id="S3.E13.m2.2.2.1.1.2.2.2.2.cmml" xref="S3.E13.m2.2.2.1.1.2.2.2.2"><csymbol cd="ambiguous" id="S3.E13.m2.2.2.1.1.2.2.2.2.1.cmml" xref="S3.E13.m2.2.2.1.1.2.2.2.2">superscript</csymbol><apply id="S3.E13.m2.2.2.1.1.2.2.2.2.2.cmml" xref="S3.E13.m2.2.2.1.1.2.2.2.2"><csymbol cd="ambiguous" id="S3.E13.m2.2.2.1.1.2.2.2.2.2.1.cmml" xref="S3.E13.m2.2.2.1.1.2.2.2.2">subscript</csymbol><ci id="S3.E13.m2.2.2.1.1.2.2.2.2.2.2.cmml" xref="S3.E13.m2.2.2.1.1.2.2.2.2.2.2">𝒟</ci><apply id="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.cmml" xref="S3.E13.m2.2.2.1.1.2.2.2.2.2.3"><times id="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.1.cmml" xref="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.1"></times><ci id="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.2.cmml" xref="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.2">𝑠</ci><ci id="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.3.cmml" xref="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.3">𝑜</ci><ci id="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.4.cmml" xref="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.4">𝑢</ci><ci id="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.5.cmml" xref="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.5">𝑟</ci><ci id="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.6.cmml" xref="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.6">𝑐</ci><ci id="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.7.cmml" xref="S3.E13.m2.2.2.1.1.2.2.2.2.2.3.7">𝑒</ci></apply></apply><apply id="S3.E13.m2.1.1.1.cmml" xref="S3.E13.m2.1.1.1"><minus id="S3.E13.m2.1.1.1.2.cmml" xref="S3.E13.m2.1.1.1.2"></minus><apply id="S3.E13.m2.1.1.1.3.cmml" xref="S3.E13.m2.1.1.1.3"><times id="S3.E13.m2.1.1.1.3.1.cmml" xref="S3.E13.m2.1.1.1.3.1"></times><ci id="S3.E13.m2.1.1.1.3.2.cmml" xref="S3.E13.m2.1.1.1.3.2">𝑚</ci><ci id="S3.E13.m2.1.1.1.3.3.cmml" xref="S3.E13.m2.1.1.1.3.3">𝑒</ci><ci id="S3.E13.m2.1.1.1.3.4.cmml" xref="S3.E13.m2.1.1.1.3.4">𝑡</ci><ci id="S3.E13.m2.1.1.1.3.5.cmml" xref="S3.E13.m2.1.1.1.3.5">𝑎</ci></apply><apply id="S3.E13.m2.1.1.1.4.cmml" xref="S3.E13.m2.1.1.1.4"><times id="S3.E13.m2.1.1.1.4.1.cmml" xref="S3.E13.m2.1.1.1.4.1"></times><ci id="S3.E13.m2.1.1.1.4.2.cmml" xref="S3.E13.m2.1.1.1.4.2">𝑡</ci><ci id="S3.E13.m2.1.1.1.4.3.cmml" xref="S3.E13.m2.1.1.1.4.3">𝑟</ci><ci id="S3.E13.m2.1.1.1.4.4.cmml" xref="S3.E13.m2.1.1.1.4.4">𝑎</ci><ci id="S3.E13.m2.1.1.1.4.5.cmml" xref="S3.E13.m2.1.1.1.4.5">𝑖</ci><ci id="S3.E13.m2.1.1.1.4.6.cmml" xref="S3.E13.m2.1.1.1.4.6">𝑛</ci><ci id="S3.E13.m2.1.1.1.1.cmml" xref="S3.E13.m2.1.1.1.1">𝑖</ci></apply></apply></apply></interval></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.E13.m2.2c">\displaystyle=\operatorname*{argmin}_{\theta}\mathcal{L}^{inner}(\theta_{i},% \mathcal{D}_{source}^{meta-train(i)}),</annotation><annotation encoding="application/x-llamapun" id="S3.E13.m2.2d">= roman_argmin start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT caligraphic_L start_POSTSUPERSCRIPT italic_i italic_n italic_n italic_e italic_r end_POSTSUPERSCRIPT ( italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , caligraphic_D start_POSTSUBSCRIPT italic_s italic_o italic_u italic_r italic_c italic_e end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_e italic_t italic_a - italic_t italic_r italic_a italic_i italic_n ( italic_i ) end_POSTSUPERSCRIPT ) ,</annotation></semantics></math></td> <td class="ltx_eqn_cell ltx_eqn_center_padright"></td> <td class="ltx_eqn_cell ltx_eqn_eqno ltx_align_middle ltx_align_right" rowspan="1"><span class="ltx_tag ltx_tag_equation ltx_align_right">(13)</span></td> </tr></tbody> </table> <p class="ltx_p" id="S3.SS3.p1.8"><span class="ltx_text" id="S3.SS3.p1.8.1" style="font-size:90%;">where </span><math alttext="\mathcal{D}" class="ltx_Math" display="inline" id="S3.SS3.p1.1.m1.1"><semantics id="S3.SS3.p1.1.m1.1a"><mi class="ltx_font_mathcaligraphic" id="S3.SS3.p1.1.m1.1.1" mathsize="90%" xref="S3.SS3.p1.1.m1.1.1.cmml">𝒟</mi><annotation-xml encoding="MathML-Content" id="S3.SS3.p1.1.m1.1b"><ci id="S3.SS3.p1.1.m1.1.1.cmml" xref="S3.SS3.p1.1.m1.1.1">𝒟</ci></annotation-xml><annotation encoding="application/x-tex" id="S3.SS3.p1.1.m1.1c">\mathcal{D}</annotation><annotation encoding="application/x-llamapun" id="S3.SS3.p1.1.m1.1d">caligraphic_D</annotation></semantics></math><span class="ltx_text" id="S3.SS3.p1.8.2" style="font-size:90%;"> represents the training dataset from the source domain, which is divided into a meta-train dataset denoted as </span><math alttext="\mathcal{D}_{source}^{meta-train}" class="ltx_Math" display="inline" id="S3.SS3.p1.2.m2.1"><semantics id="S3.SS3.p1.2.m2.1a"><msubsup id="S3.SS3.p1.2.m2.1.1" xref="S3.SS3.p1.2.m2.1.1.cmml"><mi class="ltx_font_mathcaligraphic" id="S3.SS3.p1.2.m2.1.1.2.2" mathsize="90%" xref="S3.SS3.p1.2.m2.1.1.2.2.cmml">𝒟</mi><mrow id="S3.SS3.p1.2.m2.1.1.2.3" xref="S3.SS3.p1.2.m2.1.1.2.3.cmml"><mi id="S3.SS3.p1.2.m2.1.1.2.3.2" mathsize="90%" xref="S3.SS3.p1.2.m2.1.1.2.3.2.cmml">s</mi><mo id="S3.SS3.p1.2.m2.1.1.2.3.1" xref="S3.SS3.p1.2.m2.1.1.2.3.1.cmml">⁢</mo><mi id="S3.SS3.p1.2.m2.1.1.2.3.3" mathsize="90%" xref="S3.SS3.p1.2.m2.1.1.2.3.3.cmml">o</mi><mo id="S3.SS3.p1.2.m2.1.1.2.3.1a" xref="S3.SS3.p1.2.m2.1.1.2.3.1.cmml">⁢</mo><mi id="S3.SS3.p1.2.m2.1.1.2.3.4" mathsize="90%" xref="S3.SS3.p1.2.m2.1.1.2.3.4.cmml">u</mi><mo id="S3.SS3.p1.2.m2.1.1.2.3.1b" xref="S3.SS3.p1.2.m2.1.1.2.3.1.cmml">⁢</mo><mi id="S3.SS3.p1.2.m2.1.1.2.3.5" mathsize="90%" xref="S3.SS3.p1.2.m2.1.1.2.3.5.cmml">r</mi><mo id="S3.SS3.p1.2.m2.1.1.2.3.1c" xref="S3.SS3.p1.2.m2.1.1.2.3.1.cmml">⁢</mo><mi id="S3.SS3.p1.2.m2.1.1.2.3.6" mathsize="90%" xref="S3.SS3.p1.2.m2.1.1.2.3.6.cmml">c</mi><mo id="S3.SS3.p1.2.m2.1.1.2.3.1d" xref="S3.SS3.p1.2.m2.1.1.2.3.1.cmml">⁢</mo><mi id="S3.SS3.p1.2.m2.1.1.2.3.7" mathsize="90%" xref="S3.SS3.p1.2.m2.1.1.2.3.7.cmml">e</mi></mrow><mrow id="S3.SS3.p1.2.m2.1.1.3" xref="S3.SS3.p1.2.m2.1.1.3.cmml"><mrow id="S3.SS3.p1.2.m2.1.1.3.2" xref="S3.SS3.p1.2.m2.1.1.3.2.cmml"><mi id="S3.SS3.p1.2.m2.1.1.3.2.2" mathsize="90%" xref="S3.SS3.p1.2.m2.1.1.3.2.2.cmml">m</mi><mo id="S3.SS3.p1.2.m2.1.1.3.2.1" xref="S3.SS3.p1.2.m2.1.1.3.2.1.cmml">⁢</mo><mi id="S3.SS3.p1.2.m2.1.1.3.2.3" mathsize="90%" xref="S3.SS3.p1.2.m2.1.1.3.2.3.cmml">e</mi><mo id="S3.SS3.p1.2.m2.1.1.3.2.1a" xref="S3.SS3.p1.2.m2.1.1.3.2.1.cmml">⁢</mo><mi id="S3.SS3.p1.2.m2.1.1.3.2.4" mathsize="90%" xref="S3.SS3.p1.2.m2.1.1.3.2.4.cmml">t</mi><mo id="S3.SS3.p1.2.m2.1.1.3.2.1b" xref="S3.SS3.p1.2.m2.1.1.3.2.1.cmml">⁢</mo><mi id="S3.SS3.p1.2.m2.1.1.3.2.5" mathsize="90%" xref="S3.SS3.p1.2.m2.1.1.3.2.5.cmml">a</mi></mrow><mo id="S3.SS3.p1.2.m2.1.1.3.1" mathsize="90%" xref="S3.SS3.p1.2.m2.1.1.3.1.cmml">−</mo><mrow id="S3.SS3.p1.2.m2.1.1.3.3" xref="S3.SS3.p1.2.m2.1.1.3.3.cmml"><mi id="S3.SS3.p1.2.m2.1.1.3.3.2" mathsize="90%" xref="S3.SS3.p1.2.m2.1.1.3.3.2.cmml">t</mi><mo id="S3.SS3.p1.2.m2.1.1.3.3.1" xref="S3.SS3.p1.2.m2.1.1.3.3.1.cmml">⁢</mo><mi id="S3.SS3.p1.2.m2.1.1.3.3.3" mathsize="90%" xref="S3.SS3.p1.2.m2.1.1.3.3.3.cmml">r</mi><mo id="S3.SS3.p1.2.m2.1.1.3.3.1a" xref="S3.SS3.p1.2.m2.1.1.3.3.1.cmml">⁢</mo><mi id="S3.SS3.p1.2.m2.1.1.3.3.4" mathsize="90%" xref="S3.SS3.p1.2.m2.1.1.3.3.4.cmml">a</mi><mo id="S3.SS3.p1.2.m2.1.1.3.3.1b" xref="S3.SS3.p1.2.m2.1.1.3.3.1.cmml">⁢</mo><mi id="S3.SS3.p1.2.m2.1.1.3.3.5" mathsize="90%" xref="S3.SS3.p1.2.m2.1.1.3.3.5.cmml">i</mi><mo id="S3.SS3.p1.2.m2.1.1.3.3.1c" xref="S3.SS3.p1.2.m2.1.1.3.3.1.cmml">⁢</mo><mi id="S3.SS3.p1.2.m2.1.1.3.3.6" mathsize="90%" xref="S3.SS3.p1.2.m2.1.1.3.3.6.cmml">n</mi></mrow></mrow></msubsup><annotation-xml encoding="MathML-Content" id="S3.SS3.p1.2.m2.1b"><apply id="S3.SS3.p1.2.m2.1.1.cmml" xref="S3.SS3.p1.2.m2.1.1"><csymbol cd="ambiguous" id="S3.SS3.p1.2.m2.1.1.1.cmml" xref="S3.SS3.p1.2.m2.1.1">superscript</csymbol><apply id="S3.SS3.p1.2.m2.1.1.2.cmml" xref="S3.SS3.p1.2.m2.1.1"><csymbol cd="ambiguous" id="S3.SS3.p1.2.m2.1.1.2.1.cmml" xref="S3.SS3.p1.2.m2.1.1">subscript</csymbol><ci id="S3.SS3.p1.2.m2.1.1.2.2.cmml" xref="S3.SS3.p1.2.m2.1.1.2.2">𝒟</ci><apply id="S3.SS3.p1.2.m2.1.1.2.3.cmml" xref="S3.SS3.p1.2.m2.1.1.2.3"><times id="S3.SS3.p1.2.m2.1.1.2.3.1.cmml" xref="S3.SS3.p1.2.m2.1.1.2.3.1"></times><ci id="S3.SS3.p1.2.m2.1.1.2.3.2.cmml" xref="S3.SS3.p1.2.m2.1.1.2.3.2">𝑠</ci><ci id="S3.SS3.p1.2.m2.1.1.2.3.3.cmml" xref="S3.SS3.p1.2.m2.1.1.2.3.3">𝑜</ci><ci id="S3.SS3.p1.2.m2.1.1.2.3.4.cmml" xref="S3.SS3.p1.2.m2.1.1.2.3.4">𝑢</ci><ci id="S3.SS3.p1.2.m2.1.1.2.3.5.cmml" xref="S3.SS3.p1.2.m2.1.1.2.3.5">𝑟</ci><ci id="S3.SS3.p1.2.m2.1.1.2.3.6.cmml" xref="S3.SS3.p1.2.m2.1.1.2.3.6">𝑐</ci><ci id="S3.SS3.p1.2.m2.1.1.2.3.7.cmml" xref="S3.SS3.p1.2.m2.1.1.2.3.7">𝑒</ci></apply></apply><apply id="S3.SS3.p1.2.m2.1.1.3.cmml" xref="S3.SS3.p1.2.m2.1.1.3"><minus id="S3.SS3.p1.2.m2.1.1.3.1.cmml" xref="S3.SS3.p1.2.m2.1.1.3.1"></minus><apply id="S3.SS3.p1.2.m2.1.1.3.2.cmml" xref="S3.SS3.p1.2.m2.1.1.3.2"><times id="S3.SS3.p1.2.m2.1.1.3.2.1.cmml" xref="S3.SS3.p1.2.m2.1.1.3.2.1"></times><ci id="S3.SS3.p1.2.m2.1.1.3.2.2.cmml" xref="S3.SS3.p1.2.m2.1.1.3.2.2">𝑚</ci><ci id="S3.SS3.p1.2.m2.1.1.3.2.3.cmml" xref="S3.SS3.p1.2.m2.1.1.3.2.3">𝑒</ci><ci id="S3.SS3.p1.2.m2.1.1.3.2.4.cmml" xref="S3.SS3.p1.2.m2.1.1.3.2.4">𝑡</ci><ci id="S3.SS3.p1.2.m2.1.1.3.2.5.cmml" xref="S3.SS3.p1.2.m2.1.1.3.2.5">𝑎</ci></apply><apply id="S3.SS3.p1.2.m2.1.1.3.3.cmml" xref="S3.SS3.p1.2.m2.1.1.3.3"><times id="S3.SS3.p1.2.m2.1.1.3.3.1.cmml" xref="S3.SS3.p1.2.m2.1.1.3.3.1"></times><ci id="S3.SS3.p1.2.m2.1.1.3.3.2.cmml" xref="S3.SS3.p1.2.m2.1.1.3.3.2">𝑡</ci><ci id="S3.SS3.p1.2.m2.1.1.3.3.3.cmml" xref="S3.SS3.p1.2.m2.1.1.3.3.3">𝑟</ci><ci id="S3.SS3.p1.2.m2.1.1.3.3.4.cmml" xref="S3.SS3.p1.2.m2.1.1.3.3.4">𝑎</ci><ci id="S3.SS3.p1.2.m2.1.1.3.3.5.cmml" xref="S3.SS3.p1.2.m2.1.1.3.3.5">𝑖</ci><ci id="S3.SS3.p1.2.m2.1.1.3.3.6.cmml" xref="S3.SS3.p1.2.m2.1.1.3.3.6">𝑛</ci></apply></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.SS3.p1.2.m2.1c">\mathcal{D}_{source}^{meta-train}</annotation><annotation encoding="application/x-llamapun" id="S3.SS3.p1.2.m2.1d">caligraphic_D start_POSTSUBSCRIPT italic_s italic_o italic_u italic_r italic_c italic_e end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_e italic_t italic_a - italic_t italic_r italic_a italic_i italic_n end_POSTSUPERSCRIPT</annotation></semantics></math><span class="ltx_text" id="S3.SS3.p1.8.3" style="font-size:90%;"> and a meta-test dataset denoted as </span><math alttext="\mathcal{D}_{source}^{meta-test}" class="ltx_Math" display="inline" id="S3.SS3.p1.3.m3.1"><semantics id="S3.SS3.p1.3.m3.1a"><msubsup id="S3.SS3.p1.3.m3.1.1" xref="S3.SS3.p1.3.m3.1.1.cmml"><mi class="ltx_font_mathcaligraphic" id="S3.SS3.p1.3.m3.1.1.2.2" mathsize="90%" xref="S3.SS3.p1.3.m3.1.1.2.2.cmml">𝒟</mi><mrow id="S3.SS3.p1.3.m3.1.1.2.3" xref="S3.SS3.p1.3.m3.1.1.2.3.cmml"><mi id="S3.SS3.p1.3.m3.1.1.2.3.2" mathsize="90%" xref="S3.SS3.p1.3.m3.1.1.2.3.2.cmml">s</mi><mo id="S3.SS3.p1.3.m3.1.1.2.3.1" xref="S3.SS3.p1.3.m3.1.1.2.3.1.cmml">⁢</mo><mi id="S3.SS3.p1.3.m3.1.1.2.3.3" mathsize="90%" xref="S3.SS3.p1.3.m3.1.1.2.3.3.cmml">o</mi><mo id="S3.SS3.p1.3.m3.1.1.2.3.1a" xref="S3.SS3.p1.3.m3.1.1.2.3.1.cmml">⁢</mo><mi id="S3.SS3.p1.3.m3.1.1.2.3.4" mathsize="90%" xref="S3.SS3.p1.3.m3.1.1.2.3.4.cmml">u</mi><mo id="S3.SS3.p1.3.m3.1.1.2.3.1b" xref="S3.SS3.p1.3.m3.1.1.2.3.1.cmml">⁢</mo><mi id="S3.SS3.p1.3.m3.1.1.2.3.5" mathsize="90%" xref="S3.SS3.p1.3.m3.1.1.2.3.5.cmml">r</mi><mo id="S3.SS3.p1.3.m3.1.1.2.3.1c" xref="S3.SS3.p1.3.m3.1.1.2.3.1.cmml">⁢</mo><mi id="S3.SS3.p1.3.m3.1.1.2.3.6" mathsize="90%" xref="S3.SS3.p1.3.m3.1.1.2.3.6.cmml">c</mi><mo id="S3.SS3.p1.3.m3.1.1.2.3.1d" xref="S3.SS3.p1.3.m3.1.1.2.3.1.cmml">⁢</mo><mi id="S3.SS3.p1.3.m3.1.1.2.3.7" mathsize="90%" xref="S3.SS3.p1.3.m3.1.1.2.3.7.cmml">e</mi></mrow><mrow id="S3.SS3.p1.3.m3.1.1.3" xref="S3.SS3.p1.3.m3.1.1.3.cmml"><mrow id="S3.SS3.p1.3.m3.1.1.3.2" xref="S3.SS3.p1.3.m3.1.1.3.2.cmml"><mi id="S3.SS3.p1.3.m3.1.1.3.2.2" mathsize="90%" xref="S3.SS3.p1.3.m3.1.1.3.2.2.cmml">m</mi><mo id="S3.SS3.p1.3.m3.1.1.3.2.1" xref="S3.SS3.p1.3.m3.1.1.3.2.1.cmml">⁢</mo><mi id="S3.SS3.p1.3.m3.1.1.3.2.3" mathsize="90%" xref="S3.SS3.p1.3.m3.1.1.3.2.3.cmml">e</mi><mo id="S3.SS3.p1.3.m3.1.1.3.2.1a" xref="S3.SS3.p1.3.m3.1.1.3.2.1.cmml">⁢</mo><mi id="S3.SS3.p1.3.m3.1.1.3.2.4" mathsize="90%" xref="S3.SS3.p1.3.m3.1.1.3.2.4.cmml">t</mi><mo id="S3.SS3.p1.3.m3.1.1.3.2.1b" xref="S3.SS3.p1.3.m3.1.1.3.2.1.cmml">⁢</mo><mi id="S3.SS3.p1.3.m3.1.1.3.2.5" mathsize="90%" xref="S3.SS3.p1.3.m3.1.1.3.2.5.cmml">a</mi></mrow><mo id="S3.SS3.p1.3.m3.1.1.3.1" mathsize="90%" xref="S3.SS3.p1.3.m3.1.1.3.1.cmml">−</mo><mrow id="S3.SS3.p1.3.m3.1.1.3.3" xref="S3.SS3.p1.3.m3.1.1.3.3.cmml"><mi id="S3.SS3.p1.3.m3.1.1.3.3.2" mathsize="90%" xref="S3.SS3.p1.3.m3.1.1.3.3.2.cmml">t</mi><mo id="S3.SS3.p1.3.m3.1.1.3.3.1" xref="S3.SS3.p1.3.m3.1.1.3.3.1.cmml">⁢</mo><mi id="S3.SS3.p1.3.m3.1.1.3.3.3" mathsize="90%" xref="S3.SS3.p1.3.m3.1.1.3.3.3.cmml">e</mi><mo id="S3.SS3.p1.3.m3.1.1.3.3.1a" xref="S3.SS3.p1.3.m3.1.1.3.3.1.cmml">⁢</mo><mi id="S3.SS3.p1.3.m3.1.1.3.3.4" mathsize="90%" xref="S3.SS3.p1.3.m3.1.1.3.3.4.cmml">s</mi><mo id="S3.SS3.p1.3.m3.1.1.3.3.1b" xref="S3.SS3.p1.3.m3.1.1.3.3.1.cmml">⁢</mo><mi id="S3.SS3.p1.3.m3.1.1.3.3.5" mathsize="90%" xref="S3.SS3.p1.3.m3.1.1.3.3.5.cmml">t</mi></mrow></mrow></msubsup><annotation-xml encoding="MathML-Content" id="S3.SS3.p1.3.m3.1b"><apply id="S3.SS3.p1.3.m3.1.1.cmml" xref="S3.SS3.p1.3.m3.1.1"><csymbol cd="ambiguous" id="S3.SS3.p1.3.m3.1.1.1.cmml" xref="S3.SS3.p1.3.m3.1.1">superscript</csymbol><apply id="S3.SS3.p1.3.m3.1.1.2.cmml" xref="S3.SS3.p1.3.m3.1.1"><csymbol cd="ambiguous" id="S3.SS3.p1.3.m3.1.1.2.1.cmml" xref="S3.SS3.p1.3.m3.1.1">subscript</csymbol><ci id="S3.SS3.p1.3.m3.1.1.2.2.cmml" xref="S3.SS3.p1.3.m3.1.1.2.2">𝒟</ci><apply id="S3.SS3.p1.3.m3.1.1.2.3.cmml" xref="S3.SS3.p1.3.m3.1.1.2.3"><times id="S3.SS3.p1.3.m3.1.1.2.3.1.cmml" xref="S3.SS3.p1.3.m3.1.1.2.3.1"></times><ci id="S3.SS3.p1.3.m3.1.1.2.3.2.cmml" xref="S3.SS3.p1.3.m3.1.1.2.3.2">𝑠</ci><ci id="S3.SS3.p1.3.m3.1.1.2.3.3.cmml" xref="S3.SS3.p1.3.m3.1.1.2.3.3">𝑜</ci><ci id="S3.SS3.p1.3.m3.1.1.2.3.4.cmml" xref="S3.SS3.p1.3.m3.1.1.2.3.4">𝑢</ci><ci id="S3.SS3.p1.3.m3.1.1.2.3.5.cmml" xref="S3.SS3.p1.3.m3.1.1.2.3.5">𝑟</ci><ci id="S3.SS3.p1.3.m3.1.1.2.3.6.cmml" xref="S3.SS3.p1.3.m3.1.1.2.3.6">𝑐</ci><ci id="S3.SS3.p1.3.m3.1.1.2.3.7.cmml" xref="S3.SS3.p1.3.m3.1.1.2.3.7">𝑒</ci></apply></apply><apply id="S3.SS3.p1.3.m3.1.1.3.cmml" xref="S3.SS3.p1.3.m3.1.1.3"><minus id="S3.SS3.p1.3.m3.1.1.3.1.cmml" xref="S3.SS3.p1.3.m3.1.1.3.1"></minus><apply id="S3.SS3.p1.3.m3.1.1.3.2.cmml" xref="S3.SS3.p1.3.m3.1.1.3.2"><times id="S3.SS3.p1.3.m3.1.1.3.2.1.cmml" xref="S3.SS3.p1.3.m3.1.1.3.2.1"></times><ci id="S3.SS3.p1.3.m3.1.1.3.2.2.cmml" xref="S3.SS3.p1.3.m3.1.1.3.2.2">𝑚</ci><ci id="S3.SS3.p1.3.m3.1.1.3.2.3.cmml" xref="S3.SS3.p1.3.m3.1.1.3.2.3">𝑒</ci><ci id="S3.SS3.p1.3.m3.1.1.3.2.4.cmml" xref="S3.SS3.p1.3.m3.1.1.3.2.4">𝑡</ci><ci id="S3.SS3.p1.3.m3.1.1.3.2.5.cmml" xref="S3.SS3.p1.3.m3.1.1.3.2.5">𝑎</ci></apply><apply id="S3.SS3.p1.3.m3.1.1.3.3.cmml" xref="S3.SS3.p1.3.m3.1.1.3.3"><times id="S3.SS3.p1.3.m3.1.1.3.3.1.cmml" xref="S3.SS3.p1.3.m3.1.1.3.3.1"></times><ci id="S3.SS3.p1.3.m3.1.1.3.3.2.cmml" xref="S3.SS3.p1.3.m3.1.1.3.3.2">𝑡</ci><ci id="S3.SS3.p1.3.m3.1.1.3.3.3.cmml" xref="S3.SS3.p1.3.m3.1.1.3.3.3">𝑒</ci><ci id="S3.SS3.p1.3.m3.1.1.3.3.4.cmml" xref="S3.SS3.p1.3.m3.1.1.3.3.4">𝑠</ci><ci id="S3.SS3.p1.3.m3.1.1.3.3.5.cmml" xref="S3.SS3.p1.3.m3.1.1.3.3.5">𝑡</ci></apply></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.SS3.p1.3.m3.1c">\mathcal{D}_{source}^{meta-test}</annotation><annotation encoding="application/x-llamapun" id="S3.SS3.p1.3.m3.1d">caligraphic_D start_POSTSUBSCRIPT italic_s italic_o italic_u italic_r italic_c italic_e end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m italic_e italic_t italic_a - italic_t italic_e italic_s italic_t end_POSTSUPERSCRIPT</annotation></semantics></math><span class="ltx_text" id="S3.SS3.p1.8.4" style="font-size:90%;">. The combination of a meta-train and a meta-test dataset constitutes a meta-task, indexed by </span><math alttext="i" class="ltx_Math" display="inline" id="S3.SS3.p1.4.m4.1"><semantics id="S3.SS3.p1.4.m4.1a"><mi id="S3.SS3.p1.4.m4.1.1" mathsize="90%" xref="S3.SS3.p1.4.m4.1.1.cmml">i</mi><annotation-xml encoding="MathML-Content" id="S3.SS3.p1.4.m4.1b"><ci id="S3.SS3.p1.4.m4.1.1.cmml" xref="S3.SS3.p1.4.m4.1.1">𝑖</ci></annotation-xml><annotation encoding="application/x-tex" id="S3.SS3.p1.4.m4.1c">i</annotation><annotation encoding="application/x-llamapun" id="S3.SS3.p1.4.m4.1d">italic_i</annotation></semantics></math><span class="ltx_text" id="S3.SS3.p1.8.5" style="font-size:90%;">, which means the </span><math alttext="i" class="ltx_Math" display="inline" id="S3.SS3.p1.5.m5.1"><semantics id="S3.SS3.p1.5.m5.1a"><mi id="S3.SS3.p1.5.m5.1.1" mathsize="90%" xref="S3.SS3.p1.5.m5.1.1.cmml">i</mi><annotation-xml encoding="MathML-Content" id="S3.SS3.p1.5.m5.1b"><ci id="S3.SS3.p1.5.m5.1.1.cmml" xref="S3.SS3.p1.5.m5.1.1">𝑖</ci></annotation-xml><annotation encoding="application/x-tex" id="S3.SS3.p1.5.m5.1c">i</annotation><annotation encoding="application/x-llamapun" id="S3.SS3.p1.5.m5.1d">italic_i</annotation></semantics></math><span class="ltx_text" id="S3.SS3.p1.8.6" style="font-size:90%;">-th one in total </span><math alttext="T" class="ltx_Math" display="inline" id="S3.SS3.p1.6.m6.1"><semantics id="S3.SS3.p1.6.m6.1a"><mi id="S3.SS3.p1.6.m6.1.1" mathsize="90%" xref="S3.SS3.p1.6.m6.1.1.cmml">T</mi><annotation-xml encoding="MathML-Content" id="S3.SS3.p1.6.m6.1b"><ci id="S3.SS3.p1.6.m6.1.1.cmml" xref="S3.SS3.p1.6.m6.1.1">𝑇</ci></annotation-xml><annotation encoding="application/x-tex" id="S3.SS3.p1.6.m6.1c">T</annotation><annotation encoding="application/x-llamapun" id="S3.SS3.p1.6.m6.1d">italic_T</annotation></semantics></math><span class="ltx_text" id="S3.SS3.p1.8.7" style="font-size:90%;"> meta-tasks. </span><math alttext="\theta_{i}" class="ltx_Math" display="inline" id="S3.SS3.p1.7.m7.1"><semantics id="S3.SS3.p1.7.m7.1a"><msub id="S3.SS3.p1.7.m7.1.1" xref="S3.SS3.p1.7.m7.1.1.cmml"><mi id="S3.SS3.p1.7.m7.1.1.2" mathsize="90%" xref="S3.SS3.p1.7.m7.1.1.2.cmml">θ</mi><mi id="S3.SS3.p1.7.m7.1.1.3" mathsize="90%" xref="S3.SS3.p1.7.m7.1.1.3.cmml">i</mi></msub><annotation-xml encoding="MathML-Content" id="S3.SS3.p1.7.m7.1b"><apply id="S3.SS3.p1.7.m7.1.1.cmml" xref="S3.SS3.p1.7.m7.1.1"><csymbol cd="ambiguous" id="S3.SS3.p1.7.m7.1.1.1.cmml" xref="S3.SS3.p1.7.m7.1.1">subscript</csymbol><ci id="S3.SS3.p1.7.m7.1.1.2.cmml" xref="S3.SS3.p1.7.m7.1.1.2">𝜃</ci><ci id="S3.SS3.p1.7.m7.1.1.3.cmml" xref="S3.SS3.p1.7.m7.1.1.3">𝑖</ci></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.SS3.p1.7.m7.1c">\theta_{i}</annotation><annotation encoding="application/x-llamapun" id="S3.SS3.p1.7.m7.1d">italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT</annotation></semantics></math><span class="ltx_text" id="S3.SS3.p1.8.8" style="font-size:90%;"> and </span><math alttext="\theta_{i+1}" class="ltx_Math" display="inline" id="S3.SS3.p1.8.m8.1"><semantics id="S3.SS3.p1.8.m8.1a"><msub id="S3.SS3.p1.8.m8.1.1" xref="S3.SS3.p1.8.m8.1.1.cmml"><mi id="S3.SS3.p1.8.m8.1.1.2" mathsize="90%" xref="S3.SS3.p1.8.m8.1.1.2.cmml">θ</mi><mrow id="S3.SS3.p1.8.m8.1.1.3" xref="S3.SS3.p1.8.m8.1.1.3.cmml"><mi id="S3.SS3.p1.8.m8.1.1.3.2" mathsize="90%" xref="S3.SS3.p1.8.m8.1.1.3.2.cmml">i</mi><mo id="S3.SS3.p1.8.m8.1.1.3.1" mathsize="90%" xref="S3.SS3.p1.8.m8.1.1.3.1.cmml">+</mo><mn id="S3.SS3.p1.8.m8.1.1.3.3" mathsize="90%" xref="S3.SS3.p1.8.m8.1.1.3.3.cmml">1</mn></mrow></msub><annotation-xml encoding="MathML-Content" id="S3.SS3.p1.8.m8.1b"><apply id="S3.SS3.p1.8.m8.1.1.cmml" xref="S3.SS3.p1.8.m8.1.1"><csymbol cd="ambiguous" id="S3.SS3.p1.8.m8.1.1.1.cmml" xref="S3.SS3.p1.8.m8.1.1">subscript</csymbol><ci id="S3.SS3.p1.8.m8.1.1.2.cmml" xref="S3.SS3.p1.8.m8.1.1.2">𝜃</ci><apply id="S3.SS3.p1.8.m8.1.1.3.cmml" xref="S3.SS3.p1.8.m8.1.1.3"><plus id="S3.SS3.p1.8.m8.1.1.3.1.cmml" xref="S3.SS3.p1.8.m8.1.1.3.1"></plus><ci id="S3.SS3.p1.8.m8.1.1.3.2.cmml" xref="S3.SS3.p1.8.m8.1.1.3.2">𝑖</ci><cn id="S3.SS3.p1.8.m8.1.1.3.3.cmml" type="integer" xref="S3.SS3.p1.8.m8.1.1.3.3">1</cn></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.SS3.p1.8.m8.1c">\theta_{i+1}</annotation><annotation encoding="application/x-llamapun" id="S3.SS3.p1.8.m8.1d">italic_θ start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT</annotation></semantics></math><span class="ltx_text" id="S3.SS3.p1.8.9" style="font-size:90%;"> correspond to the parameters of the inner and outer loops, respectively. Note that there exists a leader-follower asymmetry between the outer and inner loops.</span></p> </div> <figure class="ltx_table" id="S3.T4"> <figcaption class="ltx_caption" style="font-size:90%;"><span class="ltx_tag ltx_tag_table"><span class="ltx_text ltx_font_bold" id="S3.T4.8.1.1">Table 4</span>: </span>Differences in EER between the proposed approach and the baseline. The first row of the table denotes genres of testing utterances, and the first column of the table represents genres of enrollment utterances. The <span class="ltx_text" id="S3.T4.9.2" style="color:#0000FF;">blue</span> number means the proposed approach outperforms the baseline. The <span class="ltx_text" id="S3.T4.10.3" style="color:#FF0000;">red</span> number means the baseline outperforms the proposed approach.</figcaption> <table class="ltx_tabular ltx_centering ltx_guessed_headers ltx_align_middle" id="S3.T4.11"> <thead class="ltx_thead"> <tr class="ltx_tr" id="S3.T4.11.1.1"> <th class="ltx_td ltx_th ltx_th_row ltx_border_tt" id="S3.T4.11.1.1.1" style="padding-left:3.5pt;padding-right:3.5pt;"></th> <th class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_tt" id="S3.T4.11.1.1.2" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text ltx_font_bold" id="S3.T4.11.1.1.2.1" style="font-size:90%;">dr</span></th> <th class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_tt" id="S3.T4.11.1.1.3" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text ltx_font_bold" id="S3.T4.11.1.1.3.1" style="font-size:90%;">en</span></th> <th class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_tt" id="S3.T4.11.1.1.4" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text ltx_font_bold" id="S3.T4.11.1.1.4.1" style="font-size:90%;">in</span></th> <th class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_tt" id="S3.T4.11.1.1.5" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text ltx_font_bold" id="S3.T4.11.1.1.5.1" style="font-size:90%;">lb</span></th> <th class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_tt" id="S3.T4.11.1.1.6" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text ltx_font_bold" id="S3.T4.11.1.1.6.1" style="font-size:90%;">re</span></th> <th class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_tt" id="S3.T4.11.1.1.7" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text ltx_font_bold" id="S3.T4.11.1.1.7.1" style="font-size:90%;">si</span></th> <th class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_tt" id="S3.T4.11.1.1.8" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text ltx_font_bold" id="S3.T4.11.1.1.8.1" style="font-size:90%;">sp</span></th> <th class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_tt" id="S3.T4.11.1.1.9" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text ltx_font_bold" id="S3.T4.11.1.1.9.1" style="font-size:90%;">vl</span></th> </tr> </thead> <tbody class="ltx_tbody"> <tr class="ltx_tr" id="S3.T4.11.2.1"> <th class="ltx_td ltx_align_right ltx_th ltx_th_row ltx_border_t" id="S3.T4.11.2.1.1" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.2.1.1.1" style="font-size:90%;">dr</span></th> <td class="ltx_td ltx_align_center ltx_border_t" id="S3.T4.11.2.1.2" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.2.1.2.1" style="font-size:90%;">+1.65</span></td> <td class="ltx_td ltx_align_center ltx_border_t" id="S3.T4.11.2.1.3" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.2.1.3.1" style="font-size:90%;color:#FF0000;">-1.23</span></td> <td class="ltx_td ltx_align_center ltx_border_t" id="S3.T4.11.2.1.4" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.2.1.4.1" style="font-size:90%;color:#0000FF;">+1.72</span></td> <td class="ltx_td ltx_align_center ltx_border_t" id="S3.T4.11.2.1.5" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.2.1.5.1" style="font-size:90%;color:#0000FF;">+5.97</span></td> <td class="ltx_td ltx_align_center ltx_border_t" id="S3.T4.11.2.1.6" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.2.1.6.1" style="font-size:90%;">-</span></td> <td class="ltx_td ltx_align_center ltx_border_t" id="S3.T4.11.2.1.7" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.2.1.7.1" style="font-size:90%;">-</span></td> <td class="ltx_td ltx_align_center ltx_border_t" id="S3.T4.11.2.1.8" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.2.1.8.1" style="font-size:90%;">-</span></td> <td class="ltx_td ltx_align_center ltx_border_t" id="S3.T4.11.2.1.9" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.2.1.9.1" style="font-size:90%;">-</span></td> </tr> <tr class="ltx_tr" id="S3.T4.11.3.2"> <th class="ltx_td ltx_align_right ltx_th ltx_th_row" id="S3.T4.11.3.2.1" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.3.2.1.1" style="font-size:90%;">en</span></th> <td class="ltx_td ltx_align_center" id="S3.T4.11.3.2.2" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.3.2.2.1" style="font-size:90%;color:#0000FF;">+2.71</span></td> <td class="ltx_td ltx_align_center" id="S3.T4.11.3.2.3" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.3.2.3.1" style="font-size:90%;">+3.06</span></td> <td class="ltx_td ltx_align_center" id="S3.T4.11.3.2.4" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.3.2.4.1" style="font-size:90%;color:#FF0000;">-0.02</span></td> <td class="ltx_td ltx_align_center" id="S3.T4.11.3.2.5" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.3.2.5.1" style="font-size:90%;color:#FF0000;">-1.04</span></td> <td class="ltx_td ltx_align_center" id="S3.T4.11.3.2.6" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.3.2.6.1" style="font-size:90%;color:#FF0000;">-7.35</span></td> <td class="ltx_td ltx_align_center" id="S3.T4.11.3.2.7" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.3.2.7.1" style="font-size:90%;color:#0000FF;">+3.83</span></td> <td class="ltx_td ltx_align_center" id="S3.T4.11.3.2.8" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.3.2.8.1" style="font-size:90%;color:#0000FF;">+1.20</span></td> <td class="ltx_td ltx_align_center" id="S3.T4.11.3.2.9" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.3.2.9.1" style="font-size:90%;color:#FF0000;">-1.06</span></td> </tr> <tr class="ltx_tr" id="S3.T4.11.4.3"> <th class="ltx_td ltx_align_right ltx_th ltx_th_row" id="S3.T4.11.4.3.1" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.4.3.1.1" style="font-size:90%;">in</span></th> <td class="ltx_td ltx_align_center" id="S3.T4.11.4.3.2" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.4.3.2.1" style="font-size:90%;color:#0000FF;">+4.32</span></td> <td class="ltx_td ltx_align_center" id="S3.T4.11.4.3.3" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.4.3.3.1" style="font-size:90%;color:#0000FF;">+2.13</span></td> <td class="ltx_td ltx_align_center" id="S3.T4.11.4.3.4" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.4.3.4.1" style="font-size:90%;">+2.49</span></td> <td class="ltx_td ltx_align_center" id="S3.T4.11.4.3.5" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.4.3.5.1" style="font-size:90%;color:#FF0000;">-0.64</span></td> <td class="ltx_td ltx_align_center" id="S3.T4.11.4.3.6" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.4.3.6.1" style="font-size:90%;color:#0000FF;">+2.70</span></td> <td class="ltx_td ltx_align_center" id="S3.T4.11.4.3.7" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.4.3.7.1" style="font-size:90%;color:#0000FF;">+2.72</span></td> <td class="ltx_td ltx_align_center" id="S3.T4.11.4.3.8" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.4.3.8.1" style="font-size:90%;color:#0000FF;">+3.94</span></td> <td class="ltx_td ltx_align_center" id="S3.T4.11.4.3.9" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.4.3.9.1" style="font-size:90%;color:#0000FF;">+14.06</span></td> </tr> <tr class="ltx_tr" id="S3.T4.11.5.4"> <th class="ltx_td ltx_align_right ltx_th ltx_th_row" id="S3.T4.11.5.4.1" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.5.4.1.1" style="font-size:90%;">lb</span></th> <td class="ltx_td ltx_align_center" id="S3.T4.11.5.4.2" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.5.4.2.1" style="font-size:90%;color:#FF0000;">-1.74</span></td> <td class="ltx_td ltx_align_center" id="S3.T4.11.5.4.3" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.5.4.3.1" style="font-size:90%;color:#FF0000;">-1.13</span></td> <td class="ltx_td ltx_align_center" id="S3.T4.11.5.4.4" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.5.4.4.1" style="font-size:90%;color:#FF0000;">-2.91</span></td> <td class="ltx_td ltx_align_center" id="S3.T4.11.5.4.5" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.5.4.5.1" style="font-size:90%;">+0.02</span></td> <td class="ltx_td ltx_align_center" id="S3.T4.11.5.4.6" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.5.4.6.1" style="font-size:90%;">-</span></td> <td class="ltx_td ltx_align_center" id="S3.T4.11.5.4.7" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.5.4.7.1" style="font-size:90%;color:#0000FF;">+7.73</span></td> <td class="ltx_td ltx_align_center" id="S3.T4.11.5.4.8" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.5.4.8.1" style="font-size:90%;">-</span></td> <td class="ltx_td ltx_align_center" id="S3.T4.11.5.4.9" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.5.4.9.1" style="font-size:90%;color:#0000FF;">+6.71</span></td> </tr> <tr class="ltx_tr" id="S3.T4.11.6.5"> <th class="ltx_td ltx_align_right ltx_th ltx_th_row" id="S3.T4.11.6.5.1" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.6.5.1.1" style="font-size:90%;">sp</span></th> <td class="ltx_td ltx_align_center" id="S3.T4.11.6.5.2" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.6.5.2.1" style="font-size:90%;color:#0000FF;">+2.98</span></td> <td class="ltx_td ltx_align_center" id="S3.T4.11.6.5.3" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.6.5.3.1" style="font-size:90%;color:#0000FF;">+0.36</span></td> <td class="ltx_td ltx_align_center" id="S3.T4.11.6.5.4" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.6.5.4.1" style="font-size:90%;color:#0000FF;">+3.34</span></td> <td class="ltx_td ltx_align_center" id="S3.T4.11.6.5.5" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.6.5.5.1" style="font-size:90%;color:#0000FF;">+5.17</span></td> <td class="ltx_td ltx_align_center" id="S3.T4.11.6.5.6" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.6.5.6.1" style="font-size:90%;">-</span></td> <td class="ltx_td ltx_align_center" id="S3.T4.11.6.5.7" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.6.5.7.1" style="font-size:90%;color:#0000FF;">+2.07</span></td> <td class="ltx_td ltx_align_center" id="S3.T4.11.6.5.8" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.6.5.8.1" style="font-size:90%;">-0.79</span></td> <td class="ltx_td ltx_align_center" id="S3.T4.11.6.5.9" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.6.5.9.1" style="font-size:90%;">-</span></td> </tr> <tr class="ltx_tr" id="S3.T4.11.7.6"> <th class="ltx_td ltx_align_right ltx_th ltx_th_row ltx_border_bb" id="S3.T4.11.7.6.1" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.7.6.1.1" style="font-size:90%;">vl</span></th> <td class="ltx_td ltx_align_center ltx_border_bb" id="S3.T4.11.7.6.2" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.7.6.2.1" style="font-size:90%;color:#FF0000;">-6.52</span></td> <td class="ltx_td ltx_align_center ltx_border_bb" id="S3.T4.11.7.6.3" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.7.6.3.1" style="font-size:90%;color:#0000FF;">+0.51</span></td> <td class="ltx_td ltx_align_center ltx_border_bb" id="S3.T4.11.7.6.4" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.7.6.4.1" style="font-size:90%;color:#0000FF;">+4.75</span></td> <td class="ltx_td ltx_align_center ltx_border_bb" id="S3.T4.11.7.6.5" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.7.6.5.1" style="font-size:90%;color:#0000FF;">+2.63</span></td> <td class="ltx_td ltx_align_center ltx_border_bb" id="S3.T4.11.7.6.6" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.7.6.6.1" style="font-size:90%;">-</span></td> <td class="ltx_td ltx_align_center ltx_border_bb" id="S3.T4.11.7.6.7" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.7.6.7.1" style="font-size:90%;color:#FF0000;">-2.92</span></td> <td class="ltx_td ltx_align_center ltx_border_bb" id="S3.T4.11.7.6.8" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.7.6.8.1" style="font-size:90%;">-</span></td> <td class="ltx_td ltx_align_center ltx_border_bb" id="S3.T4.11.7.6.9" style="padding-left:3.5pt;padding-right:3.5pt;"><span class="ltx_text" id="S3.T4.11.7.6.9.1" style="font-size:90%;">+2.01</span></td> </tr> </tbody> </table> </figure> <figure class="ltx_table" id="S3.T5"> <figcaption class="ltx_caption" style="font-size:80%;"><span class="ltx_tag ltx_tag_table"><span class="ltx_text ltx_font_bold" id="S3.T5.4.1.1">Table 5</span>: </span>Results of the proposed system on CNCeleb.Eval and CNComplex testing datasets. The performance is evaluated under the metrics of SV-EER and SASV-EER metrics, respectively.</figcaption> <table class="ltx_tabular ltx_centering ltx_guessed_headers ltx_align_middle" id="S3.T5.5"> <thead class="ltx_thead"> <tr class="ltx_tr" id="S3.T5.5.1.1"> <th class="ltx_td ltx_align_left ltx_th ltx_th_column ltx_th_row ltx_border_tt" id="S3.T5.5.1.1.1" rowspan="2" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text ltx_font_bold" id="S3.T5.5.1.1.1.1" style="font-size:80%;">Protocol</span></th> <th class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_tt" id="S3.T5.5.1.1.2" rowspan="2" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text ltx_font_bold" id="S3.T5.5.1.1.2.1" style="font-size:80%;">Training dataset</span></th> <th class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_tt" colspan="2" id="S3.T5.5.1.1.3" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text ltx_font_bold" id="S3.T5.5.1.1.3.1" style="font-size:80%;">CNCeleb.Eval</span></th> <th class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_tt" colspan="2" id="S3.T5.5.1.1.4" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text ltx_font_bold" id="S3.T5.5.1.1.4.1" style="font-size:80%;">CNComplex</span></th> </tr> <tr class="ltx_tr" id="S3.T5.5.2.2"> <th class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_t" id="S3.T5.5.2.2.1" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.2.2.1.1" style="font-size:80%;">SV-EER</span></th> <th class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_t" id="S3.T5.5.2.2.2" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.2.2.2.1" style="font-size:80%;">SASV-EER</span></th> <th class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_t" id="S3.T5.5.2.2.3" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.2.2.3.1" style="font-size:80%;">SV-EER</span></th> <th class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_t" id="S3.T5.5.2.2.4" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.2.2.4.1" style="font-size:80%;">SASV-EER</span></th> </tr> </thead> <tbody class="ltx_tbody"> <tr class="ltx_tr" id="S3.T5.5.3.1"> <th class="ltx_td ltx_align_left ltx_th ltx_th_row ltx_border_t" id="S3.T5.5.3.1.1" rowspan="2" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text ltx_font_bold" id="S3.T5.5.3.1.1.1" style="font-size:80%;">CGP I</span></th> <td class="ltx_td ltx_align_center ltx_border_t" id="S3.T5.5.3.1.2" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.3.1.2.1" style="font-size:80%;">CNCeleb 1&amp;2</span></td> <td class="ltx_td ltx_align_center ltx_border_t" id="S3.T5.5.3.1.3" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.3.1.3.1" style="font-size:80%;">7.96</span></td> <td class="ltx_td ltx_align_center ltx_border_t" id="S3.T5.5.3.1.4" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.3.1.4.1" style="font-size:80%;">-</span></td> <td class="ltx_td ltx_align_center ltx_border_t" id="S3.T5.5.3.1.5" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.3.1.5.1" style="font-size:80%;">8.52</span></td> <td class="ltx_td ltx_align_center ltx_border_t" id="S3.T5.5.3.1.6" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.3.1.6.1" style="font-size:80%;">7.37</span></td> </tr> <tr class="ltx_tr" id="S3.T5.5.4.2"> <td class="ltx_td ltx_align_center" id="S3.T5.5.4.2.1" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.4.2.1.1" style="font-size:80%;">Combination</span></td> <td class="ltx_td ltx_align_center" id="S3.T5.5.4.2.2" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.4.2.2.1" style="font-size:80%;">7.79</span></td> <td class="ltx_td ltx_align_center" id="S3.T5.5.4.2.3" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.4.2.3.1" style="font-size:80%;">-</span></td> <td class="ltx_td ltx_align_center" id="S3.T5.5.4.2.4" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.4.2.4.1" style="font-size:80%;">7.56</span></td> <td class="ltx_td ltx_align_center" id="S3.T5.5.4.2.5" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.4.2.5.1" style="font-size:80%;">7.25</span></td> </tr> <tr class="ltx_tr" id="S3.T5.5.5.3"> <th class="ltx_td ltx_align_left ltx_th ltx_th_row ltx_border_tt" id="S3.T5.5.5.3.1" rowspan="2" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text ltx_font_bold" id="S3.T5.5.5.3.1.1" style="font-size:80%;">CGP II</span></th> <td class="ltx_td ltx_align_center ltx_border_tt" id="S3.T5.5.5.3.2" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.5.3.2.1" style="font-size:80%;">CNCeleb 1&amp;2</span></td> <td class="ltx_td ltx_align_center ltx_border_tt" id="S3.T5.5.5.3.3" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.5.3.3.1" style="font-size:80%;">8.24</span></td> <td class="ltx_td ltx_align_center ltx_border_tt" id="S3.T5.5.5.3.4" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.5.3.4.1" style="font-size:80%;">-</span></td> <td class="ltx_td ltx_align_center ltx_border_tt" id="S3.T5.5.5.3.5" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.5.3.5.1" style="font-size:80%;">8.85</span></td> <td class="ltx_td ltx_align_center ltx_border_tt" id="S3.T5.5.5.3.6" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.5.3.6.1" style="font-size:80%;">8.57</span></td> </tr> <tr class="ltx_tr" id="S3.T5.5.6.4"> <td class="ltx_td ltx_align_center" id="S3.T5.5.6.4.1" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.6.4.1.1" style="font-size:80%;">Combination</span></td> <td class="ltx_td ltx_align_center" id="S3.T5.5.6.4.2" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.6.4.2.1" style="font-size:80%;">7.96</span></td> <td class="ltx_td ltx_align_center" id="S3.T5.5.6.4.3" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.6.4.3.1" style="font-size:80%;">-</span></td> <td class="ltx_td ltx_align_center" id="S3.T5.5.6.4.4" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.6.4.4.1" style="font-size:80%;">8.34</span></td> <td class="ltx_td ltx_align_center" id="S3.T5.5.6.4.5" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.6.4.5.1" style="font-size:80%;">8.47</span></td> </tr> <tr class="ltx_tr" id="S3.T5.5.7.5"> <th class="ltx_td ltx_align_left ltx_th ltx_th_row ltx_border_tt" id="S3.T5.5.7.5.1" rowspan="2" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text ltx_font_bold" id="S3.T5.5.7.5.1.1" style="font-size:80%;">CGP III</span></th> <td class="ltx_td ltx_align_center ltx_border_tt" id="S3.T5.5.7.5.2" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.7.5.2.1" style="font-size:80%;">CNCeleb 1&amp;2</span></td> <td class="ltx_td ltx_align_center ltx_border_tt" id="S3.T5.5.7.5.3" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.7.5.3.1" style="font-size:80%;">8.43</span></td> <td class="ltx_td ltx_align_center ltx_border_tt" id="S3.T5.5.7.5.4" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.7.5.4.1" style="font-size:80%;">-</span></td> <td class="ltx_td ltx_align_center ltx_border_tt" id="S3.T5.5.7.5.5" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.7.5.5.1" style="font-size:80%;">9.07</span></td> <td class="ltx_td ltx_align_center ltx_border_tt" id="S3.T5.5.7.5.6" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.7.5.6.1" style="font-size:80%;">8.52</span></td> </tr> <tr class="ltx_tr" id="S3.T5.5.8.6"> <td class="ltx_td ltx_align_center" id="S3.T5.5.8.6.1" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.8.6.1.1" style="font-size:80%;">Combination</span></td> <td class="ltx_td ltx_align_center" id="S3.T5.5.8.6.2" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.8.6.2.1" style="font-size:80%;">8.19</span></td> <td class="ltx_td ltx_align_center" id="S3.T5.5.8.6.3" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.8.6.3.1" style="font-size:80%;">-</span></td> <td class="ltx_td ltx_align_center" id="S3.T5.5.8.6.4" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.8.6.4.1" style="font-size:80%;">8.82</span></td> <td class="ltx_td ltx_align_center" id="S3.T5.5.8.6.5" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.8.6.5.1" style="font-size:80%;">8.25</span></td> </tr> <tr class="ltx_tr" id="S3.T5.5.9.7"> <th class="ltx_td ltx_align_left ltx_th ltx_th_row ltx_border_bb ltx_border_tt" id="S3.T5.5.9.7.1" rowspan="2" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text ltx_font_bold" id="S3.T5.5.9.7.1.1" style="font-size:80%;">CGP IV</span></th> <td class="ltx_td ltx_align_center ltx_border_tt" id="S3.T5.5.9.7.2" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.9.7.2.1" style="font-size:80%;">CNCeleb 1&amp;2</span></td> <td class="ltx_td ltx_align_center ltx_border_tt" id="S3.T5.5.9.7.3" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.9.7.3.1" style="font-size:80%;">8.23</span></td> <td class="ltx_td ltx_align_center ltx_border_tt" id="S3.T5.5.9.7.4" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.9.7.4.1" style="font-size:80%;">-</span></td> <td class="ltx_td ltx_align_center ltx_border_tt" id="S3.T5.5.9.7.5" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.9.7.5.1" style="font-size:80%;">8.68</span></td> <td class="ltx_td ltx_align_center ltx_border_tt" id="S3.T5.5.9.7.6" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.9.7.6.1" style="font-size:80%;">7.73</span></td> </tr> <tr class="ltx_tr" id="S3.T5.5.10.8"> <td class="ltx_td ltx_align_center ltx_border_bb" id="S3.T5.5.10.8.1" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.10.8.1.1" style="font-size:80%;">Combination</span></td> <td class="ltx_td ltx_align_center ltx_border_bb" id="S3.T5.5.10.8.2" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.10.8.2.1" style="font-size:80%;">8.13</span></td> <td class="ltx_td ltx_align_center ltx_border_bb" id="S3.T5.5.10.8.3" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.10.8.3.1" style="font-size:80%;">-</span></td> <td class="ltx_td ltx_align_center ltx_border_bb" id="S3.T5.5.10.8.4" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.10.8.4.1" style="font-size:80%;">8.45</span></td> <td class="ltx_td ltx_align_center ltx_border_bb" id="S3.T5.5.10.8.5" style="padding-left:2.3pt;padding-right:2.3pt;"><span class="ltx_text" id="S3.T5.5.10.8.5.1" style="font-size:80%;">7.48</span></td> </tr> </tbody> </table> </figure> <figure class="ltx_table" id="S3.T6"> <figcaption class="ltx_caption" style="font-size:90%;"><span class="ltx_tag ltx_tag_table"><span class="ltx_text ltx_font_bold" id="S3.T6.4.1.1">Table 6</span>: </span>EER (%) of experimental results on CNCeleb.Eval testing dataset for the scenario of domain mismatch. For each protocol, the baseline system is established using the proposed model trained through the straightforward supervised learning paradigm. A bold number indicates the highest performance of this genre. The presence of an unseen group is indicated within brackets</figcaption> <p class="ltx_p" id="S3.T6.5"><span class="ltx_text" id="S3.T6.5.1" style="font-size:90%;">. </span> <span class="ltx_tabular ltx_centering ltx_guessed_headers ltx_align_middle" id="S3.T6.5.2"> <span class="ltx_thead"> <span class="ltx_tr" id="S3.T6.5.2.1.1"> <span class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_tt ltx_rowspan ltx_rowspan_2" id="S3.T6.5.2.1.1.1" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.1.1.1.1" style="font-size:90%;">Protocol</span></span> <span class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_tt ltx_rowspan ltx_rowspan_2" id="S3.T6.5.2.1.1.2" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.1.1.2.1" style="font-size:90%;">System</span></span> <span class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_tt ltx_rowspan ltx_rowspan_2" id="S3.T6.5.2.1.1.3" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.1.1.3.1" style="font-size:90%;">Overall</span></span> <span class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_tt ltx_colspan ltx_colspan_3" id="S3.T6.5.2.1.1.4" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.1.1.4.1" style="font-size:90%;">Group I</span></span> <span class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_tt ltx_colspan ltx_colspan_3" id="S3.T6.5.2.1.1.5" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.1.1.5.1" style="font-size:90%;">Group II</span></span> <span class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_tt ltx_colspan ltx_colspan_2" id="S3.T6.5.2.1.1.6" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.1.1.6.1" style="font-size:90%;">Group III</span></span> <span class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_tt ltx_colspan ltx_colspan_2" id="S3.T6.5.2.1.1.7" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.1.1.7.1" style="font-size:90%;">Group IV</span></span></span> <span class="ltx_tr" id="S3.T6.5.2.2.2"> <span class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_t" id="S3.T6.5.2.2.2.1" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.2.2.1.1" style="font-size:90%;">dr</span></span> <span class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_t" id="S3.T6.5.2.2.2.2" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.2.2.2.1" style="font-size:90%;">vl</span></span> <span class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_t" id="S3.T6.5.2.2.2.3" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.2.2.3.1" style="font-size:90%;">sp</span></span> <span class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_t" id="S3.T6.5.2.2.2.4" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.2.2.4.1" style="font-size:90%;">en</span></span> <span class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_t" id="S3.T6.5.2.2.2.5" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.2.2.5.1" style="font-size:90%;">in</span></span> <span class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_t" id="S3.T6.5.2.2.2.6" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.2.2.6.1" style="font-size:90%;">pl</span></span> <span class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_t" id="S3.T6.5.2.2.2.7" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.2.2.7.1" style="font-size:90%;">lb</span></span> <span class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_t" id="S3.T6.5.2.2.2.8" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.2.2.8.1" style="font-size:90%;">mo</span></span> <span class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_t" id="S3.T6.5.2.2.2.9" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.2.2.9.1" style="font-size:90%;">si</span></span> <span class="ltx_td ltx_align_center ltx_th ltx_th_column ltx_border_t" id="S3.T6.5.2.2.2.10" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.2.2.10.1" style="font-size:90%;">re</span></span></span> </span> <span class="ltx_tbody"> <span class="ltx_tr" id="S3.T6.5.2.3.1"> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.3.1.1" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.3.1.1.1" style="font-size:90%;">CGP I</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.3.1.2" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.3.1.2.1" style="font-size:90%;">Baseline</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.3.1.3" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.3.1.3.1" style="font-size:90%;">21.31</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.3.1.4" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.3.1.4.1" style="font-size:90%;">23.15</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.3.1.5" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.3.1.5.1" style="font-size:90%;">20.77</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.3.1.6" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.3.1.6.1" style="font-size:90%;">15.86</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.3.1.7" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.3.1.7.1" style="font-size:90%;">21.61</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.3.1.8" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.3.1.8.1" style="font-size:90%;">22.99</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.3.1.9" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.3.1.9.1" style="font-size:90%;">27.35</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.3.1.10" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.3.1.10.1" style="font-size:90%;">17.50</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.3.1.11" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.3.1.11.1" style="font-size:90%;">29.32</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.3.1.12" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.3.1.12.1" style="font-size:90%;">28.63</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.3.1.13" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.3.1.13.1" style="font-size:90%;">14.58</span></span></span> <span class="ltx_tr" id="S3.T6.5.2.4.2"> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.4.2.1" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.4.2.1.1" style="font-size:90%;">(Group IV)</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.4.2.2" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.4.2.2.1" style="font-size:90%;">proposed approach</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.4.2.3" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.4.2.3.1" style="font-size:90%;">17.42</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.4.2.4" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.4.2.4.1" style="font-size:90%;">22.66</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.4.2.5" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.4.2.5.1" style="font-size:90%;">16.44</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.4.2.6" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.4.2.6.1" style="font-size:90%;">14.46</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.4.2.7" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.4.2.7.1" style="font-size:90%;">20.95</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.4.2.8" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.4.2.8.1" style="font-size:90%;">19.47</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.4.2.9" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.4.2.9.1" style="font-size:90%;">20.93</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.4.2.10" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.4.2.10.1" style="font-size:90%;">16.31</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.4.2.11" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.4.2.11.1" style="font-size:90%;">24.57</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.4.2.12" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.4.2.12.1" style="font-size:90%;">22.33</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.4.2.13" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.4.2.13.1" style="font-size:90%;">13.89</span></span></span> <span class="ltx_tr" id="S3.T6.5.2.5.3"> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.5.3.1" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.5.3.1.1" style="font-size:90%;">CGP II</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.5.3.2" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.5.3.2.1" style="font-size:90%;">Baseline</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.5.3.3" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.5.3.3.1" style="font-size:90%;">22.10</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.5.3.4" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.5.3.4.1" style="font-size:90%;">22.66</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.5.3.5" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.5.3.5.1" style="font-size:90%;">20.64</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.5.3.6" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.5.3.6.1" style="font-size:90%;">16.41</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.5.3.7" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.5.3.7.1" style="font-size:90%;">23.56</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.5.3.8" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.5.3.8.1" style="font-size:90%;">24.08</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.5.3.9" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.5.3.9.1" style="font-size:90%;">27.29</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.5.3.10" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.5.3.10.1" style="font-size:90%;">19.43</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.5.3.11" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.5.3.11.1" style="font-size:90%;">31.55</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.5.3.12" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.5.3.12.1" style="font-size:90%;">25.25</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.5.3.13" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.5.3.13.1" style="font-size:90%;">13.17</span></span></span> <span class="ltx_tr" id="S3.T6.5.2.6.4"> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.6.4.1" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.6.4.1.1" style="font-size:90%;">(Group III)</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.6.4.2" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.6.4.2.1" style="font-size:90%;">proposed approach</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.6.4.3" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.6.4.3.1" style="font-size:90%;">18.48</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.6.4.4" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.6.4.4.1" style="font-size:90%;">18.04</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.6.4.5" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.6.4.5.1" style="font-size:90%;">18.08</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.6.4.6" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.6.4.6.1" style="font-size:90%;">13.84</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.6.4.7" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.6.4.7.1" style="font-size:90%;">19.11</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.6.4.8" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.6.4.8.1" style="font-size:90%;">20.10</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.6.4.9" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.6.4.9.1" style="font-size:90%;">19.71</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.6.4.10" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.6.4.10.1" style="font-size:90%;">17.84</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.6.4.11" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.6.4.11.1" style="font-size:90%;">27.16</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.6.4.12" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.6.4.12.1" style="font-size:90%;">24.16</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.6.4.13" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.6.4.13.1" style="font-size:90%;">13.91</span></span></span> <span class="ltx_tr" id="S3.T6.5.2.7.5"> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.7.5.1" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.7.5.1.1" style="font-size:90%;">CGP III</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.7.5.2" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.7.5.2.1" style="font-size:90%;">Baseline</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.7.5.3" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.7.5.3.1" style="font-size:90%;">23.46</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.7.5.4" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.7.5.4.1" style="font-size:90%;">25.42</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.7.5.5" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.7.5.5.1" style="font-size:90%;">23.89</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.7.5.6" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.7.5.6.1" style="font-size:90%;">17.58</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.7.5.7" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.7.5.7.1" style="font-size:90%;">26.38</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.7.5.8" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.7.5.8.1" style="font-size:90%;">25.34</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.7.5.9" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.7.5.9.1" style="font-size:90%;">29.25</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.7.5.10" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.7.5.10.1" style="font-size:90%;">21.72</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.7.5.11" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.7.5.11.1" style="font-size:90%;">31.14</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.7.5.12" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.7.5.12.1" style="font-size:90%;">27.74</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.7.5.13" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.7.5.13.1" style="font-size:90%;">13.10</span></span></span> <span class="ltx_tr" id="S3.T6.5.2.8.6"> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.8.6.1" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.8.6.1.1" style="font-size:90%;">(Group II)</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.8.6.2" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.8.6.2.1" style="font-size:90%;">proposed approach</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.8.6.3" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.8.6.3.1" style="font-size:90%;">20.02</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.8.6.4" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.8.6.4.1" style="font-size:90%;">23.27</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.8.6.5" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.8.6.5.1" style="font-size:90%;">20.70</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.8.6.6" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.8.6.6.1" style="font-size:90%;">16.79</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.8.6.7" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.8.6.7.1" style="font-size:90%;">22.08</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.8.6.8" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.8.6.8.1" style="font-size:90%;">22.83</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.8.6.9" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.8.6.9.1" style="font-size:90%;">22.96</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.8.6.10" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.8.6.10.1" style="font-size:90%;">19.54</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.8.6.11" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.8.6.11.1" style="font-size:90%;">28.53</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.8.6.12" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.8.6.12.1" style="font-size:90%;">22.92</span></span> <span class="ltx_td ltx_align_center" id="S3.T6.5.2.8.6.13" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.8.6.13.1" style="font-size:90%;">13.55</span></span></span> <span class="ltx_tr" id="S3.T6.5.2.9.7"> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.9.7.1" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.9.7.1.1" style="font-size:90%;">CGP IV</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.9.7.2" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.9.7.2.1" style="font-size:90%;">Baseline</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.9.7.3" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.9.7.3.1" style="font-size:90%;">22.59</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.9.7.4" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.9.7.4.1" style="font-size:90%;">24.13</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.9.7.5" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.9.7.5.1" style="font-size:90%;">23.47</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.9.7.6" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.9.7.6.1" style="font-size:90%;">16.29</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.9.7.7" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.9.7.7.1" style="font-size:90%;">19.50</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.9.7.8" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.9.7.8.1" style="font-size:90%;">22.95</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.9.7.9" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.9.7.9.1" style="font-size:90%;">27.81</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.9.7.10" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.9.7.10.1" style="font-size:90%;">19.99</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.9.7.11" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.9.7.11.1" style="font-size:90%;">28.12</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.9.7.12" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.9.7.12.1" style="font-size:90%;">24.87</span></span> <span class="ltx_td ltx_align_center ltx_border_tt" id="S3.T6.5.2.9.7.13" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.9.7.13.1" style="font-size:90%;">11.33</span></span></span> <span class="ltx_tr" id="S3.T6.5.2.10.8"> <span class="ltx_td ltx_align_center ltx_border_bb" id="S3.T6.5.2.10.8.1" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.10.8.1.1" style="font-size:90%;">(Group I)</span></span> <span class="ltx_td ltx_align_center ltx_border_bb" id="S3.T6.5.2.10.8.2" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.10.8.2.1" style="font-size:90%;">proposed approach</span></span> <span class="ltx_td ltx_align_center ltx_border_bb" id="S3.T6.5.2.10.8.3" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.10.8.3.1" style="font-size:90%;">19.98</span></span> <span class="ltx_td ltx_align_center ltx_border_bb" id="S3.T6.5.2.10.8.4" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.10.8.4.1" style="font-size:90%;">22.35</span></span> <span class="ltx_td ltx_align_center ltx_border_bb" id="S3.T6.5.2.10.8.5" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.10.8.5.1" style="font-size:90%;">19.44</span></span> <span class="ltx_td ltx_align_center ltx_border_bb" id="S3.T6.5.2.10.8.6" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.10.8.6.1" style="font-size:90%;">14.58</span></span> <span class="ltx_td ltx_align_center ltx_border_bb" id="S3.T6.5.2.10.8.7" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text" id="S3.T6.5.2.10.8.7.1" style="font-size:90%;">21.34</span></span> <span class="ltx_td ltx_align_center ltx_border_bb" id="S3.T6.5.2.10.8.8" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.10.8.8.1" style="font-size:90%;">19.62</span></span> <span class="ltx_td ltx_align_center ltx_border_bb" id="S3.T6.5.2.10.8.9" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.10.8.9.1" style="font-size:90%;">20.87</span></span> <span class="ltx_td ltx_align_center ltx_border_bb" id="S3.T6.5.2.10.8.10" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.10.8.10.1" style="font-size:90%;">17.83</span></span> <span class="ltx_td ltx_align_center ltx_border_bb" id="S3.T6.5.2.10.8.11" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.10.8.11.1" style="font-size:90%;">24.24</span></span> <span class="ltx_td ltx_align_center ltx_border_bb" id="S3.T6.5.2.10.8.12" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.10.8.12.1" style="font-size:90%;">23.53</span></span> <span class="ltx_td ltx_align_center ltx_border_bb" id="S3.T6.5.2.10.8.13" style="padding-left:5.7pt;padding-right:5.7pt;"><span class="ltx_text ltx_font_bold" id="S3.T6.5.2.10.8.13.1" style="font-size:90%;">10.55</span></span></span> </span> </span><span class="ltx_text" id="S3.T6.5.3" style="font-size:90%;"></span></p> </figure> <section class="ltx_subsubsection" id="S3.SS3.SSS1"> <h4 class="ltx_title ltx_title_subsubsection" style="font-size:90%;"> <span class="ltx_tag ltx_tag_subsubsection">3.3.1 </span>Outer loop</h4> <div class="ltx_para" id="S3.SS3.SSS1.p1"> <p class="ltx_p" id="S3.SS3.SSS1.p1.2"><span class="ltx_text" id="S3.SS3.SSS1.p1.2.1" style="font-size:90%;">In the outer loop, the genre sampling strategy is employed to simulate the domain mismatch between the meta-train and meta-test data. Specifically, we configure the number of genres in the meta-train dataset (</span><math alttext="G_{mtr}" class="ltx_Math" display="inline" id="S3.SS3.SSS1.p1.1.m1.1"><semantics id="S3.SS3.SSS1.p1.1.m1.1a"><msub id="S3.SS3.SSS1.p1.1.m1.1.1" xref="S3.SS3.SSS1.p1.1.m1.1.1.cmml"><mi id="S3.SS3.SSS1.p1.1.m1.1.1.2" mathsize="90%" xref="S3.SS3.SSS1.p1.1.m1.1.1.2.cmml">G</mi><mrow id="S3.SS3.SSS1.p1.1.m1.1.1.3" xref="S3.SS3.SSS1.p1.1.m1.1.1.3.cmml"><mi id="S3.SS3.SSS1.p1.1.m1.1.1.3.2" mathsize="90%" xref="S3.SS3.SSS1.p1.1.m1.1.1.3.2.cmml">m</mi><mo id="S3.SS3.SSS1.p1.1.m1.1.1.3.1" xref="S3.SS3.SSS1.p1.1.m1.1.1.3.1.cmml">⁢</mo><mi id="S3.SS3.SSS1.p1.1.m1.1.1.3.3" mathsize="90%" xref="S3.SS3.SSS1.p1.1.m1.1.1.3.3.cmml">t</mi><mo id="S3.SS3.SSS1.p1.1.m1.1.1.3.1a" xref="S3.SS3.SSS1.p1.1.m1.1.1.3.1.cmml">⁢</mo><mi id="S3.SS3.SSS1.p1.1.m1.1.1.3.4" mathsize="90%" xref="S3.SS3.SSS1.p1.1.m1.1.1.3.4.cmml">r</mi></mrow></msub><annotation-xml encoding="MathML-Content" id="S3.SS3.SSS1.p1.1.m1.1b"><apply id="S3.SS3.SSS1.p1.1.m1.1.1.cmml" xref="S3.SS3.SSS1.p1.1.m1.1.1"><csymbol cd="ambiguous" id="S3.SS3.SSS1.p1.1.m1.1.1.1.cmml" xref="S3.SS3.SSS1.p1.1.m1.1.1">subscript</csymbol><ci id="S3.SS3.SSS1.p1.1.m1.1.1.2.cmml" xref="S3.SS3.SSS1.p1.1.m1.1.1.2">𝐺</ci><apply id="S3.SS3.SSS1.p1.1.m1.1.1.3.cmml" xref="S3.SS3.SSS1.p1.1.m1.1.1.3"><times id="S3.SS3.SSS1.p1.1.m1.1.1.3.1.cmml" xref="S3.SS3.SSS1.p1.1.m1.1.1.3.1"></times><ci id="S3.SS3.SSS1.p1.1.m1.1.1.3.2.cmml" xref="S3.SS3.SSS1.p1.1.m1.1.1.3.2">𝑚</ci><ci id="S3.SS3.SSS1.p1.1.m1.1.1.3.3.cmml" xref="S3.SS3.SSS1.p1.1.m1.1.1.3.3">𝑡</ci><ci id="S3.SS3.SSS1.p1.1.m1.1.1.3.4.cmml" xref="S3.SS3.SSS1.p1.1.m1.1.1.3.4">𝑟</ci></apply></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.SS3.SSS1.p1.1.m1.1c">G_{mtr}</annotation><annotation encoding="application/x-llamapun" id="S3.SS3.SSS1.p1.1.m1.1d">italic_G start_POSTSUBSCRIPT italic_m italic_t italic_r end_POSTSUBSCRIPT</annotation></semantics></math><span class="ltx_text" id="S3.SS3.SSS1.p1.2.2" style="font-size:90%;">) as </span><math alttext="G-2" class="ltx_Math" display="inline" id="S3.SS3.SSS1.p1.2.m2.1"><semantics id="S3.SS3.SSS1.p1.2.m2.1a"><mrow id="S3.SS3.SSS1.p1.2.m2.1.1" xref="S3.SS3.SSS1.p1.2.m2.1.1.cmml"><mi id="S3.SS3.SSS1.p1.2.m2.1.1.2" mathsize="90%" xref="S3.SS3.SSS1.p1.2.m2.1.1.2.cmml">G</mi><mo id="S3.SS3.SSS1.p1.2.m2.1.1.1" mathsize="90%" xref="S3.SS3.SSS1.p1.2.m2.1.1.1.cmml">−</mo><mn id="S3.SS3.SSS1.p1.2.m2.1.1.3" mathsize="90%" xref="S3.SS3.SSS1.p1.2.m2.1.1.3.cmml">2</mn></mrow><annotation-xml encoding="MathML-Content" id="S3.SS3.SSS1.p1.2.m2.1b"><apply id="S3.SS3.SSS1.p1.2.m2.1.1.cmml" xref="S3.SS3.SSS1.p1.2.m2.1.1"><minus id="S3.SS3.SSS1.p1.2.m2.1.1.1.cmml" xref="S3.SS3.SSS1.p1.2.m2.1.1.1"></minus><ci id="S3.SS3.SSS1.p1.2.m2.1.1.2.cmml" xref="S3.SS3.SSS1.p1.2.m2.1.1.2">𝐺</ci><cn id="S3.SS3.SSS1.p1.2.m2.1.1.3.cmml" type="integer" xref="S3.SS3.SSS1.p1.2.m2.1.1.3">2</cn></apply></annotation-xml><annotation encoding="application/x-tex" id="S3.SS3.SSS1.p1.2.m2.1c">G-2</annotation><annotation encoding="application/x-llamapun" id="S3.SS3.SSS1.p1.2.m2.1d">italic_G - 2</annotation></semantics></math><span class="ltx_text" id="S3.SS3.SSS1.p1.2.3" style="font-size:90%;">, with the remaining genre reserved for the meta-test dataset in all experiments in this paper. This approach is consistently applied across different training protocols.</span></p> </div> </section> <section class="ltx_subsubsection" id="S3.SS3.SSS2"> <h4 class="ltx_title ltx_title_subsubsection" style="font-size:90%;"> <span class="ltx_tag ltx_tag_subsubsection">3.3.2 </span>Inner loop</h4> <div class="ltx_para" id="S3.SS3.SSS2.p1"> <p class="ltx_p" id="S3.SS3.SSS2.p1.1"><span class="ltx_text" id="S3.SS3.SSS2.p1.1.1" style="font-size:90%;">Within the inner loop, we employ the training trial sampling method, as outlined in </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S3.SS3.SSS2.p1.1.2.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib11" title="">11</a>, <a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib27" title="">27</a><span class="ltx_text" id="S3.SS3.SSS2.p1.1.3.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S3.SS3.SSS2.p1.1.4" style="font-size:90%;">, to simulate both spoofing attacks and channel mismatch scenarios concurrently. Given that the training dataset contains both genuine and copy-synthesized utterances, with each utterance being associated with a specific genre indicating the acoustic environment, it becomes possible to leverage the pair-wise learning paradigm for generating a substantial number of training trials from the training dataset. These training trials effectively simulate various distinct scenarios involving channel mismatch and spoofing attacks, allowing the proposed model to acquire robustness-related knowledge through the inner loop training process.</span></p> </div> </section> </section> </section> <section class="ltx_section" id="S4"> <h2 class="ltx_title ltx_title_section" style="font-size:90%;"> <span class="ltx_tag ltx_tag_section">4 </span>Experimental results</h2> <section class="ltx_subsection" id="S4.SS1"> <h3 class="ltx_title ltx_title_subsection" style="font-size:90%;"> <span class="ltx_tag ltx_tag_subsection">4.1 </span>Dataset</h3> <div class="ltx_para" id="S4.SS1.p1"> <p class="ltx_p" id="S4.SS1.p1.1"><span class="ltx_text" id="S4.SS1.p1.1.1" style="font-size:90%;">In the experiments, we adhere to the cross-genre training protocols outlined in </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S4.SS1.p1.1.2.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib31" title="">31</a><span class="ltx_text" id="S4.SS1.p1.1.3.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S4.SS1.p1.1.4" style="font-size:90%;"> to segregate the fusion of the CNSpoof and CNCeleb 1&amp;2 datasets. Under each protocol, we exclude two or three genres, treating them as unseen genres found in the testing dataset. The resulting combination encompasses over 1.2 million utterances spanning eleven distinct genres. Furthermore, the diversity of the training data is enhanced by incorporating the MUSAN </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S4.SS1.p1.1.5.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib47" title="">47</a><span class="ltx_text" id="S4.SS1.p1.1.6.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S4.SS1.p1.1.7" style="font-size:90%;"> and RIRs </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S4.SS1.p1.1.8.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib48" title="">48</a><span class="ltx_text" id="S4.SS1.p1.1.9.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S4.SS1.p1.1.10" style="font-size:90%;"> datasets.</span></p> </div> <div class="ltx_para" id="S4.SS1.p2"> <p class="ltx_p" id="S4.SS1.p2.1"><span class="ltx_text" id="S4.SS1.p2.1.1" style="font-size:90%;">CNCeleb.Eval and CNComplex are both employed as the testing datasets, along with their respective evaluation protocols, to assess the performance of the proposed model. Both datasets feature 196 speakers for enrollment and consist of 17,777 utterances in the testing data, resulting in a total of 3,484,292 testing trials for each evaluation protocol.</span></p> </div> </section> <section class="ltx_subsection" id="S4.SS2"> <h3 class="ltx_title ltx_title_subsection" style="font-size:90%;"> <span class="ltx_tag ltx_tag_subsection">4.2 </span>Experimental settings</h3> <div class="ltx_para" id="S4.SS2.p1"> <p class="ltx_p" id="S4.SS2.p1.2"><span class="ltx_text" id="S4.SS2.p1.2.1" style="font-size:90%;">To facilitate the implementation of the meta-learning paradigm, a subset of 64 samples was randomly selected from the dataset. Within this subset, one genre was chosen randomly to constitute the meta-test dataset for the outer loop, while the remaining samples were utilized as the meta-train dataset for the inner loop. All systems underwent training for 80 epochs using an Adam optimizer </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S4.SS2.p1.2.2.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib49" title="">49</a><span class="ltx_text" id="S4.SS2.p1.2.3.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S4.SS2.p1.2.4" style="font-size:90%;">. Training commenced with an initial learning rate of 0.01, </span><math alttext="\beta_{1}" class="ltx_Math" display="inline" id="S4.SS2.p1.1.m1.1"><semantics id="S4.SS2.p1.1.m1.1a"><msub id="S4.SS2.p1.1.m1.1.1" xref="S4.SS2.p1.1.m1.1.1.cmml"><mi id="S4.SS2.p1.1.m1.1.1.2" mathsize="90%" xref="S4.SS2.p1.1.m1.1.1.2.cmml">β</mi><mn id="S4.SS2.p1.1.m1.1.1.3" mathsize="90%" xref="S4.SS2.p1.1.m1.1.1.3.cmml">1</mn></msub><annotation-xml encoding="MathML-Content" id="S4.SS2.p1.1.m1.1b"><apply id="S4.SS2.p1.1.m1.1.1.cmml" xref="S4.SS2.p1.1.m1.1.1"><csymbol cd="ambiguous" id="S4.SS2.p1.1.m1.1.1.1.cmml" xref="S4.SS2.p1.1.m1.1.1">subscript</csymbol><ci id="S4.SS2.p1.1.m1.1.1.2.cmml" xref="S4.SS2.p1.1.m1.1.1.2">𝛽</ci><cn id="S4.SS2.p1.1.m1.1.1.3.cmml" type="integer" xref="S4.SS2.p1.1.m1.1.1.3">1</cn></apply></annotation-xml><annotation encoding="application/x-tex" id="S4.SS2.p1.1.m1.1c">\beta_{1}</annotation><annotation encoding="application/x-llamapun" id="S4.SS2.p1.1.m1.1d">italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT</annotation></semantics></math><span class="ltx_text" id="S4.SS2.p1.2.5" style="font-size:90%;"> of 0.9, and </span><math alttext="\beta_{2}" class="ltx_Math" display="inline" id="S4.SS2.p1.2.m2.1"><semantics id="S4.SS2.p1.2.m2.1a"><msub id="S4.SS2.p1.2.m2.1.1" xref="S4.SS2.p1.2.m2.1.1.cmml"><mi id="S4.SS2.p1.2.m2.1.1.2" mathsize="90%" xref="S4.SS2.p1.2.m2.1.1.2.cmml">β</mi><mn id="S4.SS2.p1.2.m2.1.1.3" mathsize="90%" xref="S4.SS2.p1.2.m2.1.1.3.cmml">2</mn></msub><annotation-xml encoding="MathML-Content" id="S4.SS2.p1.2.m2.1b"><apply id="S4.SS2.p1.2.m2.1.1.cmml" xref="S4.SS2.p1.2.m2.1.1"><csymbol cd="ambiguous" id="S4.SS2.p1.2.m2.1.1.1.cmml" xref="S4.SS2.p1.2.m2.1.1">subscript</csymbol><ci id="S4.SS2.p1.2.m2.1.1.2.cmml" xref="S4.SS2.p1.2.m2.1.1.2">𝛽</ci><cn id="S4.SS2.p1.2.m2.1.1.3.cmml" type="integer" xref="S4.SS2.p1.2.m2.1.1.3">2</cn></apply></annotation-xml><annotation encoding="application/x-tex" id="S4.SS2.p1.2.m2.1c">\beta_{2}</annotation><annotation encoding="application/x-llamapun" id="S4.SS2.p1.2.m2.1d">italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT</annotation></semantics></math><span class="ltx_text" id="S4.SS2.p1.2.6" style="font-size:90%;"> of 0.98. A warm-up strategy was employed, gradually increasing the learning rate from 0 to 0.001 within the first 5,000 steps and subsequently reducing the learning rate using an exponential scheduler.</span></p> </div> </section> <section class="ltx_subsection" id="S4.SS3"> <h3 class="ltx_title ltx_title_subsection" style="font-size:90%;"> <span class="ltx_tag ltx_tag_subsection">4.3 </span>Results and analysis</h3> <div class="ltx_para" id="S4.SS3.p1"> <p class="ltx_p" id="S4.SS3.p1.1"><span class="ltx_text" id="S4.SS3.p1.1.1" style="font-size:90%;">To illustrate the promising outcomes of the proposed model, which integrates a comprehensive learning paradigm, we conduct independent evaluations of the system across various scenarios encompassing channel mismatch, spoofing attacks, and domain mismatch, respectively.</span></p> </div> <section class="ltx_subsubsection" id="S4.SS3.SSS1"> <h4 class="ltx_title ltx_title_subsubsection" style="font-size:90%;"> <span class="ltx_tag ltx_tag_subsubsection">4.3.1 </span>Result for channel mismatch scenario</h4> <div class="ltx_para" id="S4.SS3.SSS1.p1"> <p class="ltx_p" id="S4.SS3.SSS1.p1.1"><span class="ltx_text" id="S4.SS3.SSS1.p1.1.1" style="font-size:90%;">To assess the performance of the proposed approach in scenarios involving channel mismatch, we utilize a protocol based on the CNCeleb.Eval testing dataset similar to the one outlined in </span><cite class="ltx_cite ltx_citemacro_cite"><span class="ltx_text" id="S4.SS3.SSS1.p1.1.2.1" style="font-size:90%;">[</span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#bib.bib11" title="">11</a><span class="ltx_text" id="S4.SS3.SSS1.p1.1.3.2" style="font-size:90%;">]</span></cite><span class="ltx_text" id="S4.SS3.SSS1.p1.1.4" style="font-size:90%;">, which encompasses various mismatch conditions. In this experimental evaluation, the baseline system is established using the proposed model trained through the straightforward supervised learning paradigm without the meta-learning paradigm. The differences in EER between the system and the baseline system within the context of channel mismatch are presented in Table </span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S3.T4" style="font-size:90%;" title="Table 4 ‣ 3.3 Learning paradigm ‣ 3 Proposed integrated approach ‣ Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_tag">4</span></a><span class="ltx_text" id="S4.SS3.SSS1.p1.1.5" style="font-size:90%;">. Each cell in the table indicates the EER difference.</span></p> </div> <div class="ltx_para" id="S4.SS3.SSS1.p2"> <p class="ltx_p" id="S4.SS3.SSS1.p2.1"><span class="ltx_text" id="S4.SS3.SSS1.p2.1.1" style="font-size:90%;">The table reveals that the proposed approach exhibits significantly improved performance in scenarios characterized by channel mismatch, which refers to the genre mismatch between the enrollment and testing utterances in the case. Positive differences are notable when testing utterances from the “dr,” “in,” “lb,” “sp,” and “vl” genres are enrolled from the “dr,” “en,” “in,” “sp,” and “vl” genres, respectively. This analysis demonstrates that the proposed approach is particularly effective in mitigating the impact of channel mismatch, emphasizing its capability to adapt to varied mismatch conditions during channel variability scenarios.</span></p> </div> </section> <section class="ltx_subsubsection" id="S4.SS3.SSS2"> <h4 class="ltx_title ltx_title_subsubsection" style="font-size:90%;"> <span class="ltx_tag ltx_tag_subsubsection">4.3.2 </span>Result for spoofing attack scenario</h4> <div class="ltx_para" id="S4.SS3.SSS2.p1"> <p class="ltx_p" id="S4.SS3.SSS2.p1.1"><span class="ltx_text" id="S4.SS3.SSS2.p1.1.1" style="font-size:90%;">Table </span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S3.T5" style="font-size:90%;" title="Table 5 ‣ 3.3 Learning paradigm ‣ 3 Proposed integrated approach ‣ Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_tag">5</span></a><span class="ltx_text" id="S4.SS3.SSS2.p1.1.2" style="font-size:90%;"> provides a comprehensive overview of the performance evaluation results for the proposed system considering various training protocols and two separate testing datasets. The evaluation is conducted using the SV-EER and SASV-EER metrics.</span></p> </div> <div class="ltx_para" id="S4.SS3.SSS2.p2"> <p class="ltx_p" id="S4.SS3.SSS2.p2.1"><span class="ltx_text" id="S4.SS3.SSS2.p2.1.1" style="font-size:90%;">First and foremost, comparing the number of SV-EERs on the CNCeleb.Eval dataset of two different training datasets for all training protocols, it is evident that the inclusion of the CNSpoof dataset in the training data consistently improves the performance of the traditional ASV task, aligning with the results presented in Section </span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S2.SS2" style="font-size:90%;" title="2.2 Preliminary experimental result of contemporary ASV model ‣ 2 Vulnerability of contemporary ASV model to simultaneous threats ‣ Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_tag">2.2</span></a><span class="ltx_text" id="S4.SS3.SSS2.p2.1.2" style="font-size:90%;">. In all training protocols, models trained with the CNSpoof data consistently outperform those trained without it, as evidenced by improvements in all SV-EER numbers.</span></p> </div> <div class="ltx_para" id="S4.SS3.SSS2.p3"> <p class="ltx_p" id="S4.SS3.SSS2.p3.1"><span class="ltx_text" id="S4.SS3.SSS2.p3.1.1" style="font-size:90%;">Furthermore, it is noteworthy that the SASV-EER values for the same system demonstrate comparable results across different protocols when compared to the SV-EER. This observation underscores the system’s capacity for maintaining stability in addressing spoofing attacks, which is a crucial attribute for practical deployment scenarios.</span></p> </div> </section> <section class="ltx_subsubsection" id="S4.SS3.SSS3"> <h4 class="ltx_title ltx_title_subsubsection" style="font-size:90%;"> <span class="ltx_tag ltx_tag_subsubsection">4.3.3 </span>Result for domain mismatch scenario</h4> <div class="ltx_para" id="S4.SS3.SSS3.p1"> <p class="ltx_p" id="S4.SS3.SSS3.p1.1"><span class="ltx_text" id="S4.SS3.SSS3.p1.1.1" style="font-size:90%;">Table </span><a class="ltx_ref" href="https://arxiv.org/html/2409.06327v1#S3.T6" style="font-size:90%;" title="Table 6 ‣ 3.3 Learning paradigm ‣ 3 Proposed integrated approach ‣ Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches"><span class="ltx_text ltx_ref_tag">6</span></a><span class="ltx_text" id="S4.SS3.SSS3.p1.1.2" style="font-size:90%;"> shows the EER results for the scenario of domain mismatch under various protocols. Notably, regardless of the training protocols employed, the proposed model, trained using the integrated learning paradigm, exhibits a substantial overall improvement in performance, as indicated by the third column in the table.</span></p> </div> <div class="ltx_para" id="S4.SS3.SSS3.p2"> <p class="ltx_p" id="S4.SS3.SSS3.p2.1"><span class="ltx_text" id="S4.SS3.SSS3.p2.1.1" style="font-size:90%;">Within each protocol, the presence of an unseen group is indicated within brackets. The proposed approach greatly outperformed in terms of EER in comparison to the baseline system, particularly in the “si” genre of CGP I and the “pl” genre of CGP III. This observation underscores the effectiveness of the proposed model, trained using the integrated learning paradigm, in mitigating the challenges posed by domain mismatch in the context of the ASV task.</span></p> </div> <div class="ltx_para" id="S4.SS3.SSS3.p3"> <p class="ltx_p" id="S4.SS3.SSS3.p3.1"><span class="ltx_text" id="S4.SS3.SSS3.p3.1.1" style="font-size:90%;">Furthermore, the EER numbers for seen genres are also enhanced through the use of the integrated learning paradigm. This suggests that the gradients derived from the meta-test optimization process can effectively serve as a regularization term, correcting the biases inherently introduced by the supervised learning paradigm.</span></p> </div> <div class="ltx_para" id="S4.SS3.SSS3.p4"> <p class="ltx_p" id="S4.SS3.SSS3.p4.1"><span class="ltx_text" id="S4.SS3.SSS3.p4.1.1" style="font-size:90%;">In summary, the results presented in the table highlight that the proposed approach consistently outperforms the baseline system in mitigating domain mismatch, resulting in improved overall EER and EER reductions across various genres in each training protocol.</span></p> </div> </section> </section> </section> <section class="ltx_section" id="S5"> <h2 class="ltx_title ltx_title_section" style="font-size:90%;"> <span class="ltx_tag ltx_tag_section">5 </span>Conclusion</h2> <div class="ltx_para" id="S5.p1"> <p class="ltx_p" id="S5.p1.1"><span class="ltx_text" id="S5.p1.1.1" style="font-size:90%;">In this paper, we proposed an asymmetric dual-path model and an integrated framework unifying pair-wise learning, spoofing simulation, and meta-learning for robustness against channel mismatch, spoofing attacks, and domain shifts. The bilevel optimization structure makes it possible to these multifaceted issues concurrently in an end-to-end model.</span></p> </div> <div class="ltx_para" id="S5.p2"> <p class="ltx_p" id="S5.p2.1"><span class="ltx_text" id="S5.p2.1.1" style="font-size:90%;">To demonstrate the advantages of the proposed model and learning paradigm, a series of experiments were conducted to show the performance improvement independently under scenarios of channel mismatch, spoofing attacks, and domain mismatch, respectively. The experiments demonstrated consistent improvements over baseline systems trained in a supervised learning paradigm. The proposed approach provides a blueprint for developing production ASV systems that are dependable under diverse channel mismatch, spoofing attacks, and domain mismatch.</span></p> </div> </section> <section class="ltx_section" id="S6"> <h2 class="ltx_title ltx_title_section" style="font-size:90%;"> <span class="ltx_tag ltx_tag_section">6 </span>ACKNOWLEDGMENTS</h2> <div class="ltx_para" id="S6.p1"> <p class="ltx_p" id="S6.p1.1"><span class="ltx_text" id="S6.p1.1.1" style="font-size:90%;">This study is partially supported by JST CREST Grants (JPMJCR18A6, JPMJCR20D3) and JST AIP Acceleration Research (JPMJCR24U3). This research was also partially funded by the project to develop and demonstrate countermeasures against disinformation and misinformation on the Internet with the Ministry of Internal Affairs and Communications of Japan.</span></p> </div> </section> <section class="ltx_bibliography" id="bib"> <h2 class="ltx_title ltx_title_bibliography" style="font-size:90%;">References</h2> <ul class="ltx_biblist"> <li class="ltx_bibitem" id="bib.bib1"> <span class="ltx_tag ltx_tag_bibitem">[1]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib1.1.1" style="font-size:90%;"> Douglas A Reynolds, Thomas F Quatieri, and Robert B Dunn, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib1.2.1" style="font-size:90%;">“Speaker Verification Using Adapted Gaussian Mixture Models,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text ltx_font_italic" id="bib.bib1.3.1" style="font-size:90%;">Digital Signal Processing</span><span class="ltx_text" id="bib.bib1.4.2" style="font-size:90%;">, vol. 10, no. 1-3, pp. 19–41, 2000. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib2"> <span class="ltx_tag ltx_tag_bibitem">[2]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib2.1.1" style="font-size:90%;"> Najim Dehak, Patrick J Kenny, Réda Dehak, Pierre Dumouchel, and Pierre Ouellet, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib2.2.1" style="font-size:90%;">“Front-end factor analysis for speaker verification,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text ltx_font_italic" id="bib.bib2.3.1" style="font-size:90%;">IEEE Transactions on Audio, Speech, and Language Processing</span><span class="ltx_text" id="bib.bib2.4.2" style="font-size:90%;">, vol. 19, no. 4, pp. 788–798, 2010. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib3"> <span class="ltx_tag ltx_tag_bibitem">[3]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib3.1.1" style="font-size:90%;"> David Snyder, Daniel Garcia-Romero, Daniel Povey, and Sanjeev Khudanpur, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib3.2.1" style="font-size:90%;">“Deep Neural Network Embeddings for Text-Independent Speaker Verification.,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib3.3.1" style="font-size:90%;">in </span><span class="ltx_text ltx_font_italic" id="bib.bib3.4.2" style="font-size:90%;">Proc. Interspeech</span><span class="ltx_text" id="bib.bib3.5.3" style="font-size:90%;">, 2017, pp. 999–1003. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib4"> <span class="ltx_tag ltx_tag_bibitem">[4]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib4.1.1" style="font-size:90%;"> Joon Son Chung, Jaesung Huh, Seongkyu Mun, Minjae Lee, Hee-Soo Heo, Soyeon Choe, Chiheon Ham, Sunghwan Jung, Bong-Jin Lee, and Icksang Han, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib4.2.1" style="font-size:90%;">“In Defence of Metric Learning for Speaker Recognition,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib4.3.1" style="font-size:90%;">in </span><span class="ltx_text ltx_font_italic" id="bib.bib4.4.2" style="font-size:90%;">Proc. Interspeech</span><span class="ltx_text" id="bib.bib4.5.3" style="font-size:90%;">, 2020, pp. 2977–2981. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib5"> <span class="ltx_tag ltx_tag_bibitem">[5]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib5.1.1" style="font-size:90%;"> Kevin J Lang, Alex H Waibel, and Geoffrey E Hinton, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib5.2.1" style="font-size:90%;">“A Time-Delay Neural Network Architecture for Isolated Word Recognition,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text ltx_font_italic" id="bib.bib5.3.1" style="font-size:90%;">Neural networks</span><span class="ltx_text" id="bib.bib5.4.2" style="font-size:90%;">, vol. 3, no. 1, pp. 23–43, 1990. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib6"> <span class="ltx_tag ltx_tag_bibitem">[6]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib6.1.1" style="font-size:90%;"> Daniel Povey, Gaofeng Cheng, Yiming Wang, Ke Li, Hainan Xu, Mahsa Arab Yarmohammadi, and Sanjeev Khudanpur, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib6.2.1" style="font-size:90%;">“Semi-orthogonal low-rank matrix factorization for deep neural networks,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib6.3.1" style="font-size:90%;">in </span><span class="ltx_text ltx_font_italic" id="bib.bib6.4.2" style="font-size:90%;">Interspeech</span><span class="ltx_text" id="bib.bib6.5.3" style="font-size:90%;">, 2018. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib7"> <span class="ltx_tag ltx_tag_bibitem">[7]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib7.1.1" style="font-size:90%;"> Brecht Desplanques, Jenthe Thienpondt, and Kris Demuynck, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib7.2.1" style="font-size:90%;">“ECAPA-TDNN: Emphasized Channel Attention, Propagation and Aggregation in TDNN Based Speaker Verification,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib7.3.1" style="font-size:90%;">in </span><span class="ltx_text ltx_font_italic" id="bib.bib7.4.2" style="font-size:90%;">Proc. Interspeech</span><span class="ltx_text" id="bib.bib7.5.3" style="font-size:90%;">, 2020, pp. 3830–3834. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib8"> <span class="ltx_tag ltx_tag_bibitem">[8]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib8.1.1" style="font-size:90%;"> Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib8.2.1" style="font-size:90%;">“Deep Residual Learning for Image Recognition,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib8.3.1" style="font-size:90%;">in </span><span class="ltx_text ltx_font_italic" id="bib.bib8.4.2" style="font-size:90%;">Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition</span><span class="ltx_text" id="bib.bib8.5.3" style="font-size:90%;">, 2016, pp. 770–778. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib9"> <span class="ltx_tag ltx_tag_bibitem">[9]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib9.1.1" style="font-size:90%;"> Hee Soo Heo, Bong-Jin Lee, Jaesung Huh, and Joon Son Chung, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib9.2.1" style="font-size:90%;">“Clova Baseline System for the VoxCeleb Speaker Recognition Challenge 2020,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text ltx_font_italic" id="bib.bib9.3.1" style="font-size:90%;">arXiv preprint arXiv:2009.14153</span><span class="ltx_text" id="bib.bib9.4.2" style="font-size:90%;">, 2020. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib10"> <span class="ltx_tag ltx_tag_bibitem">[10]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib10.1.1" style="font-size:90%;"> Shreyas Ramoji, Prashant Krishnan, and Sriram Ganapathy, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib10.2.1" style="font-size:90%;">“NPLDA: A Deep Neural PLDA Model for Speaker Verification,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib10.3.1" style="font-size:90%;">in </span><span class="ltx_text ltx_font_italic" id="bib.bib10.4.2" style="font-size:90%;">Proc. The Speaker and Language Recognition Workshop (Odyssey)</span><span class="ltx_text" id="bib.bib10.5.3" style="font-size:90%;">, 2020, pp. 202–209. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib11"> <span class="ltx_tag ltx_tag_bibitem">[11]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib11.1.1" style="font-size:90%;"> Chang Zeng, Xin Wang, Erica Cooper, Xiaoxiao Miao, and Junichi Yamagishi, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib11.2.1" style="font-size:90%;">“Attention Back-End for Automatic Speaker Verification with Multiple Enrollment Utterances,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib11.3.1" style="font-size:90%;">in </span><span class="ltx_text ltx_font_italic" id="bib.bib11.4.2" style="font-size:90%;">ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)</span><span class="ltx_text" id="bib.bib11.5.3" style="font-size:90%;">. IEEE, 2022, pp. 6717–6721. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib12"> <span class="ltx_tag ltx_tag_bibitem">[12]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib12.1.1" style="font-size:90%;"> Chang Zeng, Xiaoxiao Miao, Xin Wang, Erica Cooper, and Junichi Yamagishi, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib12.2.1" style="font-size:90%;">“Joint speaker encoder and neural back-end model for fully end-to-end automatic speaker verification with multiple enrollment utterances,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text ltx_font_italic" id="bib.bib12.3.1" style="font-size:90%;">Computer Speech &amp; Language</span><span class="ltx_text" id="bib.bib12.4.2" style="font-size:90%;">, vol. 86, pp. 101619, 2024. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib13"> <span class="ltx_tag ltx_tag_bibitem">[13]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib13.1.1" style="font-size:90%;"> Victor Zue, Stephanie Seneff, and James Glass, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib13.2.1" style="font-size:90%;">“Speech database development at mit: Timit and beyond,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text ltx_font_italic" id="bib.bib13.3.1" style="font-size:90%;">Speech communication</span><span class="ltx_text" id="bib.bib13.4.2" style="font-size:90%;">, vol. 9, no. 4, pp. 351–356, 1990. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib14"> <span class="ltx_tag ltx_tag_bibitem">[14]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib14.1.1" style="font-size:90%;"> Vassil Panayotov, Guoguo Chen, Daniel Povey, and Sanjeev Khudanpur, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib14.2.1" style="font-size:90%;">“Librispeech: an asr corpus based on public domain audio books,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib14.3.1" style="font-size:90%;">in </span><span class="ltx_text ltx_font_italic" id="bib.bib14.4.2" style="font-size:90%;">2015 IEEE international conference on acoustics, speech and signal processing (ICASSP)</span><span class="ltx_text" id="bib.bib14.5.3" style="font-size:90%;">. IEEE, 2015, pp. 5206–5210. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib15"> <span class="ltx_tag ltx_tag_bibitem">[15]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib15.1.1" style="font-size:90%;"> Arsha Nagrani, Joon Son Chung, and Andrew Zisserman, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib15.2.1" style="font-size:90%;">“VoxCeleb: A Large-Scale Speaker Identification Dataset,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib15.3.1" style="font-size:90%;">in </span><span class="ltx_text ltx_font_italic" id="bib.bib15.4.2" style="font-size:90%;">Proc. Interspeech</span><span class="ltx_text" id="bib.bib15.5.3" style="font-size:90%;">, 2017, pp. 2616–2620. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib16"> <span class="ltx_tag ltx_tag_bibitem">[16]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib16.1.1" style="font-size:90%;"> Mitchell McLaren, Luciana Ferrer, Diego Castan, and Aaron Lawson, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib16.2.1" style="font-size:90%;">“The speakers in the wild (sitw) speaker recognition database.,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib16.3.1" style="font-size:90%;">in </span><span class="ltx_text ltx_font_italic" id="bib.bib16.4.2" style="font-size:90%;">Interspeech</span><span class="ltx_text" id="bib.bib16.5.3" style="font-size:90%;">, 2016, pp. 818–822. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib17"> <span class="ltx_tag ltx_tag_bibitem">[17]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib17.1.1" style="font-size:90%;"> Yue Fan, JW Kang, LT Li, KC Li, HL Chen, ST Cheng, PY Zhang, ZY Zhou, YQ Cai, and Dong Wang, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib17.2.1" style="font-size:90%;">“CN-Celeb: A Challenging Chinese Speaker Recognition Dataset,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib17.3.1" style="font-size:90%;">in </span><span class="ltx_text ltx_font_italic" id="bib.bib17.4.2" style="font-size:90%;">ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)</span><span class="ltx_text" id="bib.bib17.5.3" style="font-size:90%;">. IEEE, 2020, pp. 7604–7608. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib18"> <span class="ltx_tag ltx_tag_bibitem">[18]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib18.1.1" style="font-size:90%;"> Sandro Cumani and Pietro Laface, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib18.2.1" style="font-size:90%;">“Joint Estimation of PLDA and Nonlinear Transformations of Speaker Vectors,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text ltx_font_italic" id="bib.bib18.3.1" style="font-size:90%;">IEEE/ACM Transactions on Audio, Speech, and Language Processing</span><span class="ltx_text" id="bib.bib18.4.2" style="font-size:90%;">, vol. 25, pp. 1890–1900, 2017. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib19"> <span class="ltx_tag ltx_tag_bibitem">[19]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib19.1.1" style="font-size:90%;"> Sergey Ioffe, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib19.2.1" style="font-size:90%;">“Probabilistic Linear Discriminant Analysis,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib19.3.1" style="font-size:90%;">in </span><span class="ltx_text ltx_font_italic" id="bib.bib19.4.2" style="font-size:90%;">European Conference on Computer Vision</span><span class="ltx_text" id="bib.bib19.5.3" style="font-size:90%;">. Springer, 2006, pp. 531–542. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib20"> <span class="ltx_tag ltx_tag_bibitem">[20]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib20.1.1" style="font-size:90%;"> Alan McCree, Gregory Sell, and Daniel Garcia-Romero, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib20.2.1" style="font-size:90%;">“Speaker Diarization Using Leave-One-Out Gaussian PLDA Clustering of DNN Embeddings,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib20.3.1" style="font-size:90%;">in </span><span class="ltx_text ltx_font_italic" id="bib.bib20.4.2" style="font-size:90%;">Proc. Interspeech 2019</span><span class="ltx_text" id="bib.bib20.5.3" style="font-size:90%;">, 2019, pp. 381–385. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib21"> <span class="ltx_tag ltx_tag_bibitem">[21]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib21.1.1" style="font-size:90%;"> Xinfeng Li, Xiaoyu Ji, Chen Yan, Chaohao Li, Yichen Li, Zhenning Zhang, and Wenyuan Xu, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib21.2.1" style="font-size:90%;">“Learning normality is enough: A software-based mitigation against inaudible voice attacks,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib21.3.1" style="font-size:90%;">in </span><span class="ltx_text ltx_font_italic" id="bib.bib21.4.2" style="font-size:90%;">32nd USENIX Security Symposium (USENIX Security 23)</span><span class="ltx_text" id="bib.bib21.5.3" style="font-size:90%;">, 2023, pp. 2455–2472. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib22"> <span class="ltx_tag ltx_tag_bibitem">[22]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib22.1.1" style="font-size:90%;"> Massimiliano Todisco, Xin Wang, Ville Vestman, Md Sahidullah, Héctor Delgado, Andreas Nautsch, Junichi Yamagishi, Nicholas Evans, Tomi Kinnunen, and Kong Aik Lee, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib22.2.1" style="font-size:90%;">“Asvspoof 2019: Future horizons in spoofed and fake audio detection,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text ltx_font_italic" id="bib.bib22.3.1" style="font-size:90%;">arXiv preprint arXiv:1904.05441</span><span class="ltx_text" id="bib.bib22.4.2" style="font-size:90%;">, 2019. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib23"> <span class="ltx_tag ltx_tag_bibitem">[23]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib23.1.1" style="font-size:90%;"> Junichi Yamagishi, Xin Wang, Massimiliano Todisco, Md Sahidullah, Jose Patino, Andreas Nautsch, Xuechen Liu, Kong Aik Lee, Tomi Kinnunen, Nicholas Evans, et al., </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib23.2.1" style="font-size:90%;">“Asvspoof 2021: accelerating progress in spoofed and deepfake speech detection,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text ltx_font_italic" id="bib.bib23.3.1" style="font-size:90%;">arXiv preprint arXiv:2109.00537</span><span class="ltx_text" id="bib.bib23.4.2" style="font-size:90%;">, 2021. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib24"> <span class="ltx_tag ltx_tag_bibitem">[24]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib24.1.1" style="font-size:90%;"> Jiangyan Yi, Ruibo Fu, Jianhua Tao, Shuai Nie, Haoxin Ma, Chenglong Wang, Tao Wang, Zhengkun Tian, Ye Bai, Cunhang Fan, et al., </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib24.2.1" style="font-size:90%;">“Add 2022: the first audio deep synthesis detection challenge,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib24.3.1" style="font-size:90%;">in </span><span class="ltx_text ltx_font_italic" id="bib.bib24.4.2" style="font-size:90%;">ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)</span><span class="ltx_text" id="bib.bib24.5.3" style="font-size:90%;">. IEEE, 2022, pp. 9216–9220. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib25"> <span class="ltx_tag ltx_tag_bibitem">[25]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib25.1.1" style="font-size:90%;"> Hanyi Zhang, Longbiao Wang, Kong Aik Lee, Meng Liu, Jianwu Dang, and Hui Chen, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib25.2.1" style="font-size:90%;">“Learning domain-invariant transformation for speaker verification,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib25.3.1" style="font-size:90%;">in </span><span class="ltx_text ltx_font_italic" id="bib.bib25.4.2" style="font-size:90%;">ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)</span><span class="ltx_text" id="bib.bib25.5.3" style="font-size:90%;">. IEEE, 2022, pp. 7177–7181. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib26"> <span class="ltx_tag ltx_tag_bibitem">[26]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib26.1.1" style="font-size:90%;"> Patrick Kenny and Pierre Dumouchel, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib26.2.1" style="font-size:90%;">“Disentangling speaker and channel effects in speaker verification,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib26.3.1" style="font-size:90%;">in </span><span class="ltx_text ltx_font_italic" id="bib.bib26.4.2" style="font-size:90%;">2004 IEEE International Conference on Acoustics, Speech, and Signal Processing</span><span class="ltx_text" id="bib.bib26.5.3" style="font-size:90%;">. IEEE, 2004, vol. 1, pp. I–37. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib27"> <span class="ltx_tag ltx_tag_bibitem">[27]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib27.1.1" style="font-size:90%;"> Chang Zeng, Lin Zhang, Meng Liu, and Junichi Yamagishi, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib27.2.1" style="font-size:90%;">“Spoofing-Aware Attention based ASV Back-end with Multiple Enrollment Utterances and a Sampling Strategy for the SASV Challenge 2022,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib27.3.1" style="font-size:90%;">in </span><span class="ltx_text ltx_font_italic" id="bib.bib27.4.2" style="font-size:90%;">Proc. Interspeech 2022</span><span class="ltx_text" id="bib.bib27.5.3" style="font-size:90%;">, 2022, pp. 2883–2887. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib28"> <span class="ltx_tag ltx_tag_bibitem">[28]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib28.1.1" style="font-size:90%;"> Jiawen Kang, Ruiqi Liu, Lantian Li, Yunqi Cai, Dong Wang, and Thomas Fang Zheng, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib28.2.1" style="font-size:90%;">“Domain-invariant speaker vector projection by model-agnostic meta-learning,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib28.3.1" style="font-size:90%;">in </span><span class="ltx_text ltx_font_italic" id="bib.bib28.4.2" style="font-size:90%;">Interspeech</span><span class="ltx_text" id="bib.bib28.5.3" style="font-size:90%;">, 2020. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib29"> <span class="ltx_tag ltx_tag_bibitem">[29]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib29.1.1" style="font-size:90%;"> Joon Son Chung, Arsha Nagrani, and Andrew Zisserman, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib29.2.1" style="font-size:90%;">“VoxCeleb2: Deep Speaker Recognition,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib29.3.1" style="font-size:90%;">in </span><span class="ltx_text ltx_font_italic" id="bib.bib29.4.2" style="font-size:90%;">Proc. Interspeech</span><span class="ltx_text" id="bib.bib29.5.3" style="font-size:90%;">, 2018, pp. 1086–1090. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib30"> <span class="ltx_tag ltx_tag_bibitem">[30]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib30.1.1" style="font-size:90%;"> Lantian Li, Ruiqi Liu, Jiawen Kang, Yue Fan, Hao Cui, Yunqi Cai, Ravichander Vipperla, Thomas Fang Zheng, and Dong Wang, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib30.2.1" style="font-size:90%;">“CN-Celeb: Multi-genre Speaker Recognition,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text ltx_font_italic" id="bib.bib30.3.1" style="font-size:90%;">Speech Communication</span><span class="ltx_text" id="bib.bib30.4.2" style="font-size:90%;">, 2022. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib31"> <span class="ltx_tag ltx_tag_bibitem">[31]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib31.1.1" style="font-size:90%;"> Chang Zeng, Xin Wang, Xiaoxiao Miao, Erica Cooper, and Junichi Yamagishi, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib31.2.1" style="font-size:90%;">“Improving Generalization Ability of Countermeasures for New Mismatch Scenario by Combining Multiple Advanced Regularization Terms,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib31.3.1" style="font-size:90%;">in </span><span class="ltx_text ltx_font_italic" id="bib.bib31.4.2" style="font-size:90%;">Proc. INTERSPEECH 2023</span><span class="ltx_text" id="bib.bib31.5.3" style="font-size:90%;">, 2023, pp. 1998–2002. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib32"> <span class="ltx_tag ltx_tag_bibitem">[32]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib32.1.1" style="font-size:90%;"> Jee weon Jung, Hemlata Tak, Hye jin Shim, Hee-Soo Heo, Bong-Jin Lee, Soo-Whan Chung, Ha-Jin Yu, Nicholas Evans, and Tomi Kinnunen, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib32.2.1" style="font-size:90%;">“SASV 2022: The First Spoofing-Aware Speaker Verification Challenge,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib32.3.1" style="font-size:90%;">in </span><span class="ltx_text ltx_font_italic" id="bib.bib32.4.2" style="font-size:90%;">Proc. Interspeech 2022</span><span class="ltx_text" id="bib.bib32.5.3" style="font-size:90%;">, 2022, pp. 2893–2897. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib33"> <span class="ltx_tag ltx_tag_bibitem">[33]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib33.1.1" style="font-size:90%;"> Hao Wang, Yitong Wang, Zheng Zhou, Xing Ji, Dihong Gong, Jingchao Zhou, Zhifeng Li, and Wei Liu, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib33.2.1" style="font-size:90%;">“CosFace: Large Margin Cosine Loss for Deep Face Recognition,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib33.3.1" style="font-size:90%;">in </span><span class="ltx_text ltx_font_italic" id="bib.bib33.4.2" style="font-size:90%;">Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition</span><span class="ltx_text" id="bib.bib33.5.3" style="font-size:90%;">, 2018, pp. 5265–5274. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib34"> <span class="ltx_tag ltx_tag_bibitem">[34]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib34.1.1" style="font-size:90%;"> Feng Wang, Jian Cheng, Weiyang Liu, and Haijun Liu, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib34.2.1" style="font-size:90%;">“Additive Margin Softmax for Face Verification,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text ltx_font_italic" id="bib.bib34.3.1" style="font-size:90%;">IEEE Signal Processing Letters</span><span class="ltx_text" id="bib.bib34.4.2" style="font-size:90%;">, vol. 25, no. 7, pp. 926–930, 2018. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib35"> <span class="ltx_tag ltx_tag_bibitem">[35]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib35.1.1" style="font-size:90%;"> Wendy J Holmes, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib35.2.1" style="font-size:90%;">“Copy synthesis of female speech using the jsru parallel formant synthesiser.,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib35.3.1" style="font-size:90%;">in </span><span class="ltx_text ltx_font_italic" id="bib.bib35.4.2" style="font-size:90%;">EUROSPEECH</span><span class="ltx_text" id="bib.bib35.5.3" style="font-size:90%;">, 1989, pp. 2513–2516. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib36"> <span class="ltx_tag ltx_tag_bibitem">[36]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib36.1.1" style="font-size:90%;"> Monisankha Pal, Dipjyoti Paul, and Goutam Saha, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib36.2.1" style="font-size:90%;">“Synthetic speech detection using fundamental frequency variation and spectral features,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text ltx_font_italic" id="bib.bib36.3.1" style="font-size:90%;">Computer Speech &amp; Language</span><span class="ltx_text" id="bib.bib36.4.2" style="font-size:90%;">, vol. 48, pp. 31–50, 2018. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib37"> <span class="ltx_tag ltx_tag_bibitem">[37]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib37.1.1" style="font-size:90%;"> Xiaohui Liu, Meng Liu, Longbiao Wang, Kong Aik Lee, Hanyi Zhang, and Jianwu Dang, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib37.2.1" style="font-size:90%;">“Leveraging Positional-Related Local-Global Dependency for Synthetic Speech Detection,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib37.3.1" style="font-size:90%;">in </span><span class="ltx_text ltx_font_italic" id="bib.bib37.4.2" style="font-size:90%;">ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)</span><span class="ltx_text" id="bib.bib37.5.3" style="font-size:90%;">, 2023, pp. 1–5. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib38"> <span class="ltx_tag ltx_tag_bibitem">[38]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib38.1.1" style="font-size:90%;"> Xinfeng Li, Kai Li, Yifan Zheng, Chen Yan, Xiaoyu Ji, and Wenyuan Xu, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib38.2.1" style="font-size:90%;">“Safeear: Content privacy-preserving audio deepfake detection,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib38.3.1" style="font-size:90%;">in </span><span class="ltx_text ltx_font_italic" id="bib.bib38.4.2" style="font-size:90%;">Proceedings of the 2024 ACM SIGSAC Conference on Computer and Communications Security (CCS)</span><span class="ltx_text" id="bib.bib38.5.3" style="font-size:90%;">, 2024. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib39"> <span class="ltx_tag ltx_tag_bibitem">[39]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib39.1.1" style="font-size:90%;"> Kai Li, Fenghua Xie, Hang Chen, Kexin Yuan, and Xiaolin Hu, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib39.2.1" style="font-size:90%;">“An audio-visual speech separation model inspired by cortico-thalamo-cortical circuits,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text ltx_font_italic" id="bib.bib39.3.1" style="font-size:90%;">IEEE Transactions on Pattern Analysis and Machine Intelligence</span><span class="ltx_text" id="bib.bib39.4.2" style="font-size:90%;">, 2024. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib40"> <span class="ltx_tag ltx_tag_bibitem">[40]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib40.1.1" style="font-size:90%;"> Kai Li, Runxuan Yang, Fuchun Sun, and Xiaolin Hu, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib40.2.1" style="font-size:90%;">“Iianet: An intra-and inter-modality attention network for audio-visual speech separation,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib40.3.1" style="font-size:90%;">in </span><span class="ltx_text ltx_font_italic" id="bib.bib40.4.2" style="font-size:90%;">Forty-first International Conference on Machine Learning</span><span class="ltx_text" id="bib.bib40.5.3" style="font-size:90%;">, 2024. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib41"> <span class="ltx_tag ltx_tag_bibitem">[41]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib41.1.1" style="font-size:90%;"> Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, </span><span class="ltx_text ltx_font_caligraphic" id="bib.bib41.2.2" style="font-size:90%;">L</span><span class="ltx_text" id="bib.bib41.3.3" style="font-size:90%;">ukasz Kaiser, and Illia Polosukhin, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib41.4.1" style="font-size:90%;">“Attention is All you Need,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib41.5.1" style="font-size:90%;">in </span><span class="ltx_text ltx_font_italic" id="bib.bib41.6.2" style="font-size:90%;">Advances in Neural Information Processing Systems</span><span class="ltx_text" id="bib.bib41.7.3" style="font-size:90%;">, 2017, pp. 5998–6008. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib42"> <span class="ltx_tag ltx_tag_bibitem">[42]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib42.1.1" style="font-size:90%;"> Yi Ren, Yangjun Ruan, Xu Tan, Tao Qin, Sheng Zhao, Zhou Zhao, and Tie-Yan Liu, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib42.2.1" style="font-size:90%;">“Fastspeech: Fast, robust and controllable text to speech,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text ltx_font_italic" id="bib.bib42.3.1" style="font-size:90%;">Advances in Neural Information Processing Systems</span><span class="ltx_text" id="bib.bib42.4.2" style="font-size:90%;">, vol. 32, 2019. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib43"> <span class="ltx_tag ltx_tag_bibitem">[43]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib43.1.1" style="font-size:90%;"> Yi Ren, Chenxu Hu, Xu Tan, Tao Qin, Sheng Zhao, Zhou Zhao, and Tie-Yan Liu, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib43.2.1" style="font-size:90%;">“FastSpeech 2: Fast and High-Quality End-to-End Text to Speech,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib43.3.1" style="font-size:90%;">in </span><span class="ltx_text ltx_font_italic" id="bib.bib43.4.2" style="font-size:90%;">9th International Conference on Learning Representations, ICLR 2021</span><span class="ltx_text" id="bib.bib43.5.3" style="font-size:90%;">, 2021. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib44"> <span class="ltx_tag ltx_tag_bibitem">[44]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib44.1.1" style="font-size:90%;"> Yang Zhang, Zhiqiang Lv, Haibin Wu, Shanshan Zhang, Pengfei Hu, Zhiyong Wu, Hung yi Lee, and Helen Meng, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib44.2.1" style="font-size:90%;">“MFA-Conformer: Multi-scale Feature Aggregation Conformer for Automatic Speaker Verification,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib44.3.1" style="font-size:90%;">in </span><span class="ltx_text ltx_font_italic" id="bib.bib44.4.2" style="font-size:90%;">Proc. Interspeech 2022</span><span class="ltx_text" id="bib.bib44.5.3" style="font-size:90%;">, 2022, pp. 306–310. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib45"> <span class="ltx_tag ltx_tag_bibitem">[45]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib45.1.1" style="font-size:90%;"> Luca Franceschi, Paolo Frasconi, Saverio Salzo, Riccardo Grazzi, and Massimiliano Pontil, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib45.2.1" style="font-size:90%;">“Bilevel programming for hyperparameter optimization and meta-learning,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib45.3.1" style="font-size:90%;">in </span><span class="ltx_text ltx_font_italic" id="bib.bib45.4.2" style="font-size:90%;">International conference on machine learning</span><span class="ltx_text" id="bib.bib45.5.3" style="font-size:90%;">. PMLR, 2018, pp. 1568–1577. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib46"> <span class="ltx_tag ltx_tag_bibitem">[46]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib46.1.1" style="font-size:90%;"> Ankur Sinha, Pekka Malo, and Kalyanmoy Deb, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib46.2.1" style="font-size:90%;">“A review on bilevel optimization: From classical to evolutionary approaches and applications,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text ltx_font_italic" id="bib.bib46.3.1" style="font-size:90%;">IEEE Transactions on Evolutionary Computation</span><span class="ltx_text" id="bib.bib46.4.2" style="font-size:90%;">, vol. 22, no. 2, pp. 276–295, 2017. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib47"> <span class="ltx_tag ltx_tag_bibitem">[47]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib47.1.1" style="font-size:90%;"> David Snyder, Guoguo Chen, and Daniel Povey, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib47.2.1" style="font-size:90%;">“Musan: A music, speech, and noise corpus,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text ltx_font_italic" id="bib.bib47.3.1" style="font-size:90%;">arXiv preprint arXiv:1510.08484</span><span class="ltx_text" id="bib.bib47.4.2" style="font-size:90%;">, 2015. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib48"> <span class="ltx_tag ltx_tag_bibitem">[48]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib48.1.1" style="font-size:90%;"> Tom Ko, Vijayaditya Peddinti, Daniel Povey, Michael L Seltzer, and Sanjeev Khudanpur, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib48.2.1" style="font-size:90%;">“A study on data augmentation of reverberant speech for robust speech recognition,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib48.3.1" style="font-size:90%;">in </span><span class="ltx_text ltx_font_italic" id="bib.bib48.4.2" style="font-size:90%;">2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)</span><span class="ltx_text" id="bib.bib48.5.3" style="font-size:90%;">. IEEE, 2017, pp. 5220–5224. </span> </span> </li> <li class="ltx_bibitem" id="bib.bib49"> <span class="ltx_tag ltx_tag_bibitem">[49]</span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib49.1.1" style="font-size:90%;"> Diederik P. Kingma and Jimmy Ba, </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib49.2.1" style="font-size:90%;">“Adam: A Method for Stochastic Optimization,” </span> </span> <span class="ltx_bibblock"><span class="ltx_text" id="bib.bib49.3.1" style="font-size:90%;">in </span><span class="ltx_text ltx_font_italic" id="bib.bib49.4.2" style="font-size:90%;">3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7-9, 2015, Conference Track Proceedings</span><span class="ltx_text" id="bib.bib49.5.3" style="font-size:90%;">, 2015. </span> </span> </li> </ul> </section> </article> </div> <footer class="ltx_page_footer"> <div class="ltx_page_logo">Generated on Tue Sep 10 08:27:45 2024 by <a class="ltx_LaTeXML_logo" href="http://dlmf.nist.gov/LaTeXML/"><span style="letter-spacing:-0.2em; margin-right:0.1em;">L<span class="ltx_font_smallcaps" style="position:relative; bottom:2.2pt;">a</span>T<span class="ltx_font_smallcaps" style="font-size:120%;position:relative; bottom:-0.2ex;">e</span></span><span style="font-size:90%; position:relative; bottom:-0.2ex;">XML</span><img alt="Mascot Sammy" src="data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAsAAAAOCAYAAAD5YeaVAAAAAXNSR0IArs4c6QAAAAZiS0dEAP8A/wD/oL2nkwAAAAlwSFlzAAALEwAACxMBAJqcGAAAAAd0SU1FB9wKExQZLWTEaOUAAAAddEVYdENvbW1lbnQAQ3JlYXRlZCB3aXRoIFRoZSBHSU1Q72QlbgAAAdpJREFUKM9tkL+L2nAARz9fPZNCKFapUn8kyI0e4iRHSR1Kb8ng0lJw6FYHFwv2LwhOpcWxTjeUunYqOmqd6hEoRDhtDWdA8ApRYsSUCDHNt5ul13vz4w0vWCgUnnEc975arX6ORqN3VqtVZbfbTQC4uEHANM3jSqXymFI6yWazP2KxWAXAL9zCUa1Wy2tXVxheKA9YNoR8Pt+aTqe4FVVVvz05O6MBhqUIBGk8Hn8HAOVy+T+XLJfLS4ZhTiRJgqIoVBRFIoric47jPnmeB1mW/9rr9ZpSSn3Lsmir1fJZlqWlUonKsvwWwD8ymc/nXwVBeLjf7xEKhdBut9Hr9WgmkyGEkJwsy5eHG5vN5g0AKIoCAEgkEkin0wQAfN9/cXPdheu6P33fBwB4ngcAcByHJpPJl+fn54mD3Gg0NrquXxeLRQAAwzAYj8cwTZPwPH9/sVg8PXweDAauqqr2cDjEer1GJBLBZDJBs9mE4zjwfZ85lAGg2+06hmGgXq+j3+/DsixYlgVN03a9Xu8jgCNCyIegIAgx13Vfd7vdu+FweG8YRkjXdWy329+dTgeSJD3ieZ7RNO0VAXAPwDEAO5VKndi2fWrb9jWl9Esul6PZbDY9Go1OZ7PZ9z/lyuD3OozU2wAAAABJRU5ErkJggg=="/></a> </div></footer> </div> </body> </html>

Pages: 1 2 3 4 5 6 7 8 9 10