INDEX
    Explanations

    proper nouns, particularly names and locations related to sports figures and teams

    New Auto-Interp
    Negative Logits
    adera
    -0.19
    ãĤ¹ãĤ¿ãĥ¼
    -0.15
    posables
    -0.15
    inders
    -0.15
    èį
    -0.15
    ÄĽr
    -0.15
    amera
    -0.14
    -NLS
    -0.14
    aras
    -0.14
    ëij¥
    -0.14
    POSITIVE LOGITS
     react
    0.21
     gestures
    0.20
     compet
    0.19
    cele
    0.18
     warming
    0.17
     reacts
    0.17
     celebrate
    0.17
     looks
    0.17
     gest
    0.17
     during
    0.17
    Act Density 0.018%

    No Known Activations