INDEX
Explanations
instances of specific locations or directions in a driving context
prepositions indicating location or spatial relationships
New Auto-Interp
Negative Logits
TPP
-0.83
INESS
-0.75
AMA
-0.71
rament
-0.69
RL
-0.67
APP
-0.67
Except
-0.66
Protect
-0.66
Parser
-0.65
Mini
-0.65
POSITIVE LOGITS
behalf
1.40
etime
1.21
yx
1.06
shore
1.05
coming
1.05
occasion
1.01
patrol
0.93
rooft
0.93
site
0.91
arrival
0.90
Activations Density 0.265%