INDEX
    Explanations

    proper nouns and references to geographical locations or entities

    New Auto-Interp
    Negative Logits
     Blasio
    -0.16
    заб
    -0.16
     dán
    -0.15
    emotion
    -0.15
    ãĥ³ãĤ¯
    -0.14
    ¯u
    -0.14
    ugin
    -0.14
    ases
    -0.14
     rodi
    -0.13
    Dock
    -0.13
    POSITIVE LOGITS
    urette
    0.19
    esis
    0.15
     Dear
    0.15
    ered
    0.15
     morph
    0.14
    warf
    0.14
    510
    0.14
     Rx
    0.14
    oire
    0.14
    e
    0.13
    Act Density 0.085%

    No Known Activations