INDEX
    Explanations

    proper nouns, specifically names of people and locations

    proper nouns, specifically names of individuals

    New Auto-Interp
    Negative Logits
    nces
    -1.02
    fights
    -0.73
    amaz
    -0.70
    mate
    -0.70
    ================
    -0.70
    ndra
    -0.69
    Ranked
    -0.69
    Ü
    -0.68
    mine
    -0.66
     mathemat
    -0.66
    POSITIVE LOGITS
    ZI
    0.90
    Neill
    0.78
     Lennon
    0.76
    elson
    0.75
     Wick
    0.75
    steen
    0.73
    asonic
    0.73
    oyd
    0.73
    otiation
    0.72
     Advertisement
    0.72
    Act Density 0.015%

    No Known Activations