INDEX
    Explanations

    explaining corrections or specific items

    New Auto-Interp
    Negative Logits
    イト
    1.03
    Kate
    0.98
     Eaton
    0.97
     Kate
    0.94
     Deacon
    0.92
     kate
    0.87
    kate
    0.86
     সিলেটের
    0.85
     Cote
    0.85
     Damon
    0.84
    POSITIVE LOGITS
     Rust
    0.96
     Woodruff
    0.88
     Martínez
    0.86
    rust
    0.85
     Chest
    0.83
     Randolph
    0.83
     Fitzgerald
    0.83
     музи
    0.82
    Chest
    0.82
    Rust
    0.78
    Act Density 1.380%

    No Known Activations