INDEX
    Explanations

    information related to current events, news, and global politics

    New Auto-Interp
    Negative Logits
    bell
    -0.92
    sis
    -0.81
    hat
    -0.71
    sil
    -0.69
    aver
    -0.68
    itus
    -0.68
    ita
    -0.67
    jab
    -0.66
    doms
    -0.65
    athy
    -0.64
    POSITIVE LOGITS
     several
    0.97
     prominently
    0.95
     plenty
    0.93
     numerous
    0.93
     dozens
    0.89
     some
    0.86
     lots
    0.86
     multiple
    0.86
     everything
    0.85
     elements
    0.83
    Act Density 2.964%

    No Known Activations