INDEX
Explanations
words related to surprises or unexpected events
mentions of the term "shock."
New Auto-Interp
Negative Logits
guyen
-0.86
amins
-0.84
uties
-0.76
subp
-0.75
©¶æ
-0.74
orney
-0.72
ccording
-0.70
igion
-0.67
allery
-0.66
Parenthood
-0.65
POSITIVE LOGITS
wave
1.06
waves
1.05
shock
1.04
absor
1.03
shock
0.92
shocks
0.89
Shock
0.86
Shock
0.85
imaru
0.81
ingly
0.81
Activations Density 0.009%