INDEX
    Explanations

    expressions of political discourse and accountability

    New Auto-Interp
    Negative Logits
    /
    -0.65
     (
    -0.57
     /
    -0.56
     vs
    -0.49
     i
    -0.48
     z
    -0.46
    感じで
    -0.46
    较为
    -0.46
    )/
    -0.46
    )
    -0.45
    POSITIVE LOGITS
     doubtnut
    1.09
     myſelf
    1.04
     صوتيه
    1.04
    ſelf
    1.03
     itſelf
    0.99
     purpoſe
    0.96
     faſt
    0.96
    ſelves
    0.93
    XmlAccessType
    0.92
     pleaſure
    0.92
    Act Density 0.285%

    No Known Activations