INDEX
Explanations
references to historical or notable landmarks
New Auto-Interp
Negative Logits
æĬĺ
-0.16
acro
-0.15
ialis
-0.15
recycl
-0.15
enburg
-0.14
wright
-0.14
oppable
-0.14
oday
-0.14
vant
-0.14
vern
-0.13
POSITIVE LOGITS
itten
0.15
agini
0.14
iska
0.14
æĢģ
0.14
æħĭ
0.14
NL
0.14
ementia
0.14
consequence
0.14
deo
0.14
gala
0.14
Activations Density 0.002%