INDEX
    Explanations

    references to authority figures and their roles or statements

    Comes after a name or title

    New Auto-Interp
    Negative Logits
    du
    -0.51
    PR
    -0.46
     z
    -0.46
    SC
    -0.45
     Hotspur
    -0.45
     PR
    -0.44
    java
    -0.44
    .
    -0.43
     ব
    -0.43
    pae
    -0.43
    POSITIVE LOGITS
     Monfieur
    1.09
    ########.
    1.03
     متعلقه
    1.02
     auffi
    1.02
    TagMode
    1.00
     iſt
    0.96
     Efq
    0.95
    ſelf
    0.95
    parsedMessage
    0.95
    #+#
    0.94
    Act Density 0.714%

    No Known Activations