INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Hardy
    -0.52
     Dt
    -0.48
     Martin
    -0.48
     gew
    -0.44
     Clifton
    -0.44
     Gri
    -0.43
     Rig
    -0.43
     Gru
    -0.43
     DT
    -0.43
     William
    -0.43
    POSITIVE LOGITS
     soccer
    2.14
     Soccer
    2.08
    Soccer
    1.97
    soccer
    1.95
    CCER
    1.72
    サッカー
    1.01
     Fußball
    0.98
     fútbol
    0.86
    0.83
     Fútbol
    0.82
    Act Density 0.004%

    No Known Activations