INDEX
    Explanations

    proper nouns related to individuals or entities

    proper nouns, primarily names and locations

    New Auto-Interp
    Negative Logits
     GOODMAN
    -0.78
     conserv
    -0.69
    accompan
    -0.67
    PDATE
    -0.67
    â̦â̦â̦â̦
    -0.66
     DIRECT
    -0.65
    VERS
    -0.65
    QUEST
    -0.64
    NRS
    -0.63
    ³³³³³³³³³³³³³³³³
    -0.62
    POSITIVE LOGITS
    a
    1.20
    al
    1.18
    o
    1.15
    e
    1.13
    i
    1.03
    y
    1.02
    ei
    1.02
    ic
    0.98
    ar
    0.97
    u
    0.96
    Act Density 0.248%

    No Known Activations