INDEX
Explanations
references to educational and academic institutions
New Auto-Interp
Negative Logits
dorf
-0.17
fen
-0.16
ater
-0.15
eny
-0.14
azo
-0.14
UREMENT
-0.14
net
-0.14
enance
-0.14
oppel
-0.14
udes
-0.14
POSITIVE LOGITS
oid
0.17
underst
0.16
678
0.15
æ£
0.15
ähr
0.14
OID
0.14
657
0.14
oids
0.14
Griffin
0.14
concent
0.13
Activations Density 0.051%