INDEX
Explanations
phrases describing actions or events happening from a specific location
phrases indicating the source or perspective of information
New Auto-Interp
Negative Logits
merce
-0.72
ratulations
-0.70
idas
-0.68
peat
-0.68
xual
-0.66
blems
-0.65
ivariate
-0.64
itivity
-0.64
gradation
-0.64
potion
-0.63
POSITIVE LOGITS
afar
1.55
inside
1.00
behind
0.98
abroad
0.95
upstairs
0.91
nowhere
0.88
whence
0.88
behind
0.87
atop
0.86
backstage
0.85
Activations Density 0.172%