INDEX
Explanations
topics related to politics and governmental issues
New Auto-Interp
Negative Logits
Eleven
-0.82
CCC
-0.81
JD
-0.71
Amen
-0.66
Rating
-0.63
paragraph
-0.63
Daniels
-0.62
Posts
-0.61
Sao
-0.61
Chronicle
-0.61
POSITIVE LOGITS
're
1.44
selves
1.26
selves
1.11
've
1.05
'll
1.03
themselves
1.03
zbollah
1.00
are
0.91
mos
0.91
atically
0.89
Activations Density 2.964%