INDEX
Explanations
and highlight quotes from various politicians
New Auto-Interp
Negative Logits
ulia
-0.60
ween
-0.58
ificial
-0.57
antics
-0.57
eways
-0.57
bees
-0.57
":""},{"-0.56
Kind
-0.55
adh
-0.55
Characters
-0.55
POSITIVE LOGITS
Jr
1.21
chairman
1.15
Democrat
1.10
chair
1.05
Ranking
0.94
Republican
0.92
Republican
0.91
Chairman
0.90
Chair
0.87
lawmaker
0.86
Activations Density 0.095%