INDEX
    Explanations

    discussions surrounding political commentary and personal opinions on various topics

    New Auto-Interp
    Negative Logits
    iset
    -0.17
     wherever
    -0.15
     respectively
    -0.14
    raph
    -0.14
     Alternative
    -0.14
    lesen
    -0.14
    abcdefghijklmnop
    -0.14
    åĿ¡
    -0.14
     enough
    -0.14
    ÑģиÑĤ
    -0.14
    POSITIVE LOGITS
     Eden
    0.16
    маÑħ
    0.16
    _DECLARE
    0.15
     anymore
    0.15
    unless
    0.15
     Unless
    0.14
    以å¤ĸ
    0.14
    ething
    0.14
    ergy
    0.14
    ogi
    0.14
    Act Density 0.260%

    No Known Activations