INDEX
    Explanations

    words related to locations or entities

    specific phrases or terms related to emotional connections and personal experiences

    New Auto-Interp
    Negative Logits
    oval
    -1.01
    umen
    -1.01
    aris
    -0.99
    urer
    -0.96
    ctor
    -0.93
    ues
    -0.93
    oir
    -0.92
    ares
    -0.91
    inar
    -0.91
    eks
    -0.91
    POSITIVE LOGITS
     flush
    0.64
     bounce
    0.63
     retina
    0.63
     Hubble
    0.60
     wink
    0.60
     cliff
    0.59
     Colossus
    0.59
     Cinema
    0.58
     Broadcasting
    0.57
     Crush
    0.57
    Act Density 0.367%

    No Known Activations