INDEX
Explanations
words related to debates
discussions and mentions of debates on various topics
New Auto-Interp
Negative Logits
printed
-0.72
ever
-0.71
alty
-0.70
eared
-0.68
jer
-0.66
veh
-0.65
uilt
-0.65
liner
-0.64
resp
-0.64
zer
-0.64
POSITIVE LOGITS
debate
1.15
debates
1.08
Debate
0.97
debating
0.92
debated
0.85
halla
0.83
moderators
0.78
forum
0.73
discussion
0.71
controversy
0.70
Activations Density 0.014%