INDEX
    Explanations

    proper nouns related to locations or people

    common prefixes and suffixes in words

    New Auto-Interp
    Negative Logits
    ĩ
    -0.69
    ī
    -0.69
    ·
    -0.68
     Citiz
    -0.67
    «
    -0.67
    orial
    -0.65
    İ
    -0.65
    ĺ
    -0.65
    OME
    -0.64
    lvl
    -0.64
    POSITIVE LOGITS
     infring
    0.56
     spokeswoman
    0.56
    èĢħ
    0.53
     accent
    0.52
    .—
    0.52
    .(
    0.52
     contam
    0.51
     spokesman
    0.51
     arrives
    0.50
    !,
    0.49
    Act Density 2.068%

    No Known Activations