INDEX
    Explanations

    words related to specific names or terms

    proper names of people, particularly those involved in sports or entertainment

    New Auto-Interp
    Negative Logits
    ovych
    -0.88
    dfx
    -0.78
    CLASS
    -0.70
    #$
    -0.64
     gobl
    -0.62
    POSE
    -0.61
    ¯
    -0.60
     reckoning
    -0.59
    PASS
    -0.59
    oppable
    -0.59
    POSITIVE LOGITS
    xus
    0.89
    igham
    0.86
    eros
    0.80
    cius
    0.78
    oglu
    0.77
    antes
    0.76
    ortium
    0.73
    abre
    0.70
    ople
    0.70
    loe
    0.70
    Act Density 0.332%

    No Known Activations