INDEX
Explanations
phrases related to political discussions and arguments
New Auto-Interp
Negative Logits
Contents
-0.78
coins
-0.76
words
-0.76
ESPN
-0.74
flies
-0.74
Statistics
-0.70
Instruct
-0.70
Examples
-0.69
marks
-0.69
encies
-0.68
POSITIVE LOGITS
revival
1.07
comprehensive
1.05
continuation
1.05
plethora
1.03
gradual
1.02
comeback
1.01
showdown
1.01
return
0.98
resurgence
0.98
boycott
0.98
Activations Density 0.344%