INDEX
    Explanations

    references to political figures and events

    New Auto-Interp
    Negative Logits
     Azerb
    -0.04
    Þ
    -0.04
    elsius
    -0.04
     guiActiveUn
    -0.04
    oÄŁ
    -0.03
    £ı
    -0.03
    ÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤ
    -0.03
    ñ
    -0.03
    ij士
    -0.03
     Vaugh
    -0.03
    POSITIVE LOGITS
    0.05
    The
    0.05
    -
    0.05
    .
    0.04
    ,
    0.04
     the
    0.04
     and
    0.04
    A
    0.04
    I
    0.04
    B
    0.04
    Act Density 2.839%

    No Known Activations