INDEX
Explanations
phrases that begin with "In" followed by a year
references to the structure of written works
New Auto-Interp
Negative Logits
endors
-0.70
lodged
-0.68
behav
-0.67
nailed
-0.66
outwe
-0.65
Rohing
-0.64
combatants
-0.64
unemploy
-0.64
distur
-0.62
revoked
-0.62
POSITIVE LOGITS
ventory
1.45
cluding
1.41
strument
1.33
cluded
1.26
clude
1.26
visible
1.26
vasive
1.22
juries
1.22
sect
1.22
iti
1.22
Activations Density 0.134%