INDEX
    Explanations

    references to criminal activity or investigations related to political figures

    New Auto-Interp
    Negative Logits
     CreateTagHelper
    -1.35
     doubtnut
    -1.31
     purpoſe
    -1.25
     Efq
    -1.23
     Majefty
    -1.23
     Shakspeare
    -1.21
     myſelf
    -1.21
     Jefus
    -1.20
     Anſ
    -1.17
     ་་
    -1.17
    POSITIVE LOGITS
    0.66
    ,
    0.64
     (
    0.61
    <eos>
    0.61
     in
    0.58
    :
    0.57
     N
    0.56
     A
    0.56
     να
    0.54
    .
    0.54
    Act Density 0.152%

    No Known Activations