INDEX
    Explanations

    references to news sources or publishers

    possessive forms related to various entities or organizations

    New Auto-Interp
    Negative Logits
    #$#$
    -0.80
    $$$$
    -0.79
    ét
    -0.76
    PLA
    -0.74
    Ùĩ
    -0.74
    those
    -0.73
    \-
    -0.72
    ا
    -0.72
    ET
    -0.72
    },
    -0.70
    POSITIVE LOGITS
     newest
    1.04
     Kevin
    1.00
     Brian
    0.97
     Erik
    0.95
     Darren
    0.95
     Jeffrey
    0.94
     Ian
    0.94
     Josh
    0.94
     Geoff
    0.93
     chief
    0.93
    Act Density 0.131%

    No Known Activations