INDEX
Explanations
phrases that indicate a location or position
references to vague or unspecified locations or times
New Auto-Interp
Negative Logits
ombat
-0.79
shock
-0.77
ii
-0.76
iframe
-0.74
urated
-0.74
icer
-0.74
iger
-0.72
andel
-0.72
abilities
-0.70
ortex
-0.70
POSITIVE LOGITS
else
1.42
Else
1.13
Else
1.04
between
0.98
along
0.95
abouts
0.94
nearer
0.92
around
0.91
upstream
0.90
downstream
0.88
Activations Density 0.036%