INDEX
Explanations
references to significant decisions or actions being taken
instances of the word "move."
New Auto-Interp
Negative Logits
Condition
-0.74
omial
-0.72
IZE
-0.61
Curve
-0.59
iciency
-0.59
icum
-0.58
raid
-0.58
oola
-0.58
concess
-0.58
Koran
-0.57
POSITIVE LOGITS
toward
1.11
towards
1.10
backs
0.87
able
0.86
forward
0.83
away
0.82
over
0.81
overs
0.80
rers
0.79
iton
0.78
Activations Density 0.031%