INDEX
    Explanations

    mentions of sports teams

    New Auto-Interp
    Negative Logits
     ReturnType
    -0.15
    inas
    -0.14
    arta
    -0.14
     jich
    -0.14
    czy
    -0.14
     domest
    -0.14
    elian
    -0.14
    metro
    -0.13
    inish
    -0.13
    evin
    -0.13
    POSITIVE LOGITS
    anik
    0.17
    _DIRS
    0.15
    avel
    0.15
    ÙĨدÛĮ
    0.14
    _Private
    0.14
    æĿIJ
    0.14
    íģ¬ê¸°
    0.14
    æ±Ĺ
    0.14
    öyle
    0.14
    ASET
    0.13
    Act Density 0.050%

    No Known Activations