INDEX
    Explanations

    proper nouns or names, particularly related to people

    repeated names or references to specific individuals in the text

    New Auto-Interp
    Negative Logits
     Aad
    -0.71
     Bie
    -0.69
     Maya
    -0.68
    thing
    -0.68
    FACE
    -0.68
    âĢ¢âĢ¢
    -0.66
     Afgh
    -0.65
     answ
    -0.64
     bubble
    -0.64
     stink
    -0.63
    POSITIVE LOGITS
     Berger
    1.99
     Macron
    1.88
    alle
    1.76
     Robertson
    1.34
     Rou
    1.24
     LC
    1.21
    olin
    1.21
    oton
    1.20
     Rogue
    1.09
    rift
    1.08
    Act Density 0.034%

    No Known Activations