INDEX
Explanations
instances of unexpected or surprising events or revelations
instances of the word "twist" in various contexts
New Auto-Interp
Negative Logits
audi
-0.75
leased
-0.68
redistributed
-0.66
Hurricanes
-0.65
minent
-0.63
Defenders
-0.63
hammad
-0.62
arel
-0.60
RES
-0.60
usable
-0.60
POSITIVE LOGITS
twist
1.29
twists
1.16
Twist
0.91
endings
0.88
Whedon
0.83
stitch
0.83
weave
0.80
twisting
0.79
stroke
0.77
nir
0.77
Activations Density 0.009%