INDEX
Explanations
politically related terms or mentions, specifically regarding impeachment and political figures
New Auto-Interp
Negative Logits
BLE
-0.74
enegger
-0.73
ecause
-0.70
EEE
-0.66
Dragonbound
-0.66
nces
-0.65
ãģ¦
-0.65
manship
-0.64
ãĤĭ
-0.63
loo
-0.63
POSITIVE LOGITS
itude
1.38
ogether
1.36
itudes
1.34
itud
1.33
itudinal
1.17
imore
1.10
uve
0.97
imeter
0.97
imately
0.84
imeters
0.83
Activations Density 0.016%