INDEX
Explanations
instances of speaking out or making public statements
instances of the phrase "speak out."
New Auto-Interp
Negative Logits
ynski
-0.75
rontal
-0.68
anos
-0.66
ppy
-0.65
liction
-0.65
xtap
-0.64
ugh
-0.64
igon
-0.63
avery
-0.60
Gadget
-0.60
POSITIVE LOGITS
stretched
0.91
loud
0.85
doors
0.74
lier
0.74
loudly
0.68
louder
0.66
landish
0.66
lander
0.66
mbuds
0.65
valves
0.64
Activations Density 0.023%