INDEX
    Explanations

    references to specific football clubs

    New Auto-Interp
    Negative Logits
    inski
    -0.20
    alogy
    -0.15
    ANNOT
    -0.15
    خاÙĨÙĩ
    -0.14
    ué
    -0.14
    raith
    -0.14
    tons
    -0.14
    quette
    -0.14
    iper
    -0.14
    394
    -0.14
    POSITIVE LOGITS
     Tw
    0.19
    CLA
    0.18
    omal
    0.16
     Barcelona
    0.16
    наÑĢ
    0.15
    iac
    0.15
    ersen
    0.15
    gee
    0.15
     twist
    0.15
     tw
    0.15
    Act Density 0.008%

    No Known Activations