INDEX
    Explanations

    sentences presenting strong opinions or emphasizing a point

    expressions of opinion or assertion

    New Auto-Interp
    Negative Logits
    nton
    -0.59
    obbies
    -0.59
    culosis
    -0.57
    elight
    -0.56
    oqu
    -0.56
    redited
    -0.56
     Pict
    -0.55
    sequently
    -0.55
    egu
    -0.55
    ospital
    -0.54
    POSITIVE LOGITS
     fucking
    0.71
     goddamn
    0.63
     bullshit
    0.62
    fuck
    0.61
     fucked
    0.60
     Canaver
    0.60
     Voldemort
    0.60
    _>
    0.60
     fuck
    0.59
     physicists
    0.59
    Act Density 1.721%

    No Known Activations