INDEX
    Explanations

    spoken and written words ending in '-st', '-ent', '-rant', or '-ist'

    terms related to naming and identification

    New Auto-Interp
    Negative Logits
     loopholes
    -0.62
    FORMATION
    -0.60
     envy
    -0.59
    ÙĴ
    -0.59
     Duterte
    -0.59
     edit
    -0.59
     coli
    -0.58
    ModLoader
    -0.58
    ":[
    -0.57
     LEVEL
    -0.57
    POSITIVE LOGITS
    gaard
    0.94
    ensen
    0.92
    baugh
    0.91
    opoulos
    0.90
    rup
    0.88
    eer
    0.88
    feld
    0.87
    cia
    0.87
    cki
    0.87
    enburg
    0.86
    Act Density 0.191%

    No Known Activations