INDEX
Explanations
references to people appearing in specific outfits or settings
occurrences of the preposition "in"
New Auto-Interp
Negative Logits
incumb
-0.94
76561
-0.70
learners
-0.68
convol
-0.67
employers
-0.67
llor
-0.64
ens
-0.62
rw
-0.61
%%
-0.60
lodged
-0.60
POSITIVE LOGITS
lieu
1.09
animate
1.06
spite
1.06
appropriately
1.05
relation
1.03
strument
1.03
clus
1.02
situ
1.02
conjunction
1.01
effect
1.01
Activations Density 0.365%