INDEX
    Explanations

    references to news articles or reporters

    possessive phrases indicating attribution to various sources or authors

    New Auto-Interp
    Negative Logits
    PLA
    -0.82
    Sov
    -0.81
    #$#$
    -0.78
    unin
    -0.78
    ét
    -0.77
    $$$$
    -0.76
    oves
    -0.74
    utory
    -0.71
    RFC
    -0.71
    sov
    -0.71
    POSITIVE LOGITS
     Geoff
    1.25
     Jonathan
    1.22
     Jeffrey
    1.22
     Jason
    1.21
     Jennifer
    1.19
     Andrew
    1.19
     Matthew
    1.19
     Ian
    1.19
     Erik
    1.18
     Jesse
    1.18
    Act Density 0.098%

    No Known Activations