INDEX
    Explanations

    proper nouns associated with leadership or prominent figures

    New Auto-Interp
    Negative Logits
     Wolver
    -0.15
    odox
    -0.15
     Blackburn
    -0.14
    azon
    -0.14
    ÄįÃŃ
    -0.14
    acco
    -0.14
    izza
    -0.14
    uard
    -0.14
    ió
    -0.14
    gress
    -0.14
    POSITIVE LOGITS
    ·
    0.15
    ĴĪ
    0.14
    zÄħd
    0.14
     Familie
    0.14
    ød
    0.14
     Andersen
    0.14
    ibilidade
    0.13
     capacit
    0.13
    rawer
    0.13
    ulet
    0.13
    Act Density 0.363%

    No Known Activations