INDEX
Explanations
phrases related to political discussions and debates, specifically mentioning both sides of an argument or issue
phrases indicating a division between opposing sides in a debate or discussion
New Auto-Interp
Negative Logits
orable
-0.70
chy
-0.66
nces
-0.65
IAN
-0.64
vity
-0.63
uala
-0.62
ndra
-0.61
itious
-0.61
rian
-0.59
jew
-0.59
POSITIVE LOGITS
aisle
0.93
spectrum
0.92
equation
0.74
Atlantic
0.72
ledger
0.71
Atlantic
0.70
divide
0.69
ticket
0.69
fence
0.68
strate
0.68
Activations Density 0.089%