INDEX
Explanations
phrases related to controversial issues or topics
New Auto-Interp
Negative Logits
SPR
-0.51
Mamm
-0.51
Suzuki
-0.48
mount
-0.47
Boone
-0.47
Lunar
-0.46
vik
-0.46
Dinosaur
-0.46
Motor
-0.46
Spl
-0.45
POSITIVE LOGITS
pursuant
0.64
namely
0.60
iatus
0.59
altogether
0.57
abroad
0.57
disag
0.55
anship
0.55
because
0.55
whatsoever
0.54
globally
0.54
Activations Density 8.126%