INDEX
Explanations
unexpected or surprising events or developments
phrases indicating unexpected changes or developments
New Auto-Interp
Negative Logits
ufact
-0.71
league
-0.69
RES
-0.68
leased
-0.66
particip
-0.65
Ducks
-0.65
audi
-0.64
hammad
-0.63
Lama
-0.62
alty
-0.62
POSITIVE LOGITS
twist
1.24
twists
1.18
Twist
0.89
endings
0.85
twisting
0.83
Whedon
0.78
weave
0.76
stroke
0.73
abouts
0.71
creen
0.71
Activations Density 0.013%