INDEX
Explanations
mentions of geographical locations
the preposition "in."
New Auto-Interp
Negative Logits
NOW
-0.73
CLASSIFIED
-0.68
Features
-0.67
cues
-0.66
ESA
-0.64
hooks
-0.63
TIME
-0.63
ingred
-0.63
ATURES
-0.62
selves
-0.62
POSITIVE LOGITS
conjunction
1.08
lieu
1.05
clusions
1.04
accordance
1.02
efficiency
1.00
vain
0.99
effic
0.98
roads
0.95
patient
0.94
relation
0.93
Activations Density 0.248%