INDEX
Explanations
information related to current events, news, and global politics
New Auto-Interp
Negative Logits
bell
-0.92
sis
-0.81
hat
-0.71
sil
-0.69
aver
-0.68
itus
-0.68
ita
-0.67
jab
-0.66
doms
-0.65
athy
-0.64
POSITIVE LOGITS
several
0.97
prominently
0.95
plenty
0.93
numerous
0.93
dozens
0.89
some
0.86
lots
0.86
multiple
0.86
everything
0.85
elements
0.83
Activations Density 2.964%