INDEX
    Explanations

    references to specific sports teams

    New Auto-Interp
    Negative Logits
    igne
    -0.16
    -in
    -0.15
    _INLINE
    -0.15
     World
    -0.15
    ppe
    -0.14
    ett
    -0.14
    dera
    -0.14
     staff
    -0.14
    corn
    -0.14
    pect
    -0.13
    POSITIVE LOGITS
    ÙħÛĮر
    0.15
    avel
    0.14
    mant
    0.14
    NCY
    0.14
    Italic
    0.14
    /Dk
    0.14
    gra
    0.14
    tv
    0.14
    ÙĨدÛĮ
    0.14
    avl
    0.14
    Act Density 0.044%

    No Known Activations