INDEX
Explanations
mentions of the word "De" or variations thereof
New Auto-Interp
Negative Logits
gens
-0.17
/xhtml
-0.15
ç´
-0.15
pell
-0.14
Ľ
-0.14
g
-0.14
ointments
-0.14
ваннÑı
-0.14
gio
-0.14
thers
-0.14
POSITIVE LOGITS
eper
0.24
aling
0.24
acon
0.23
ird
0.22
legates
0.21
arest
0.20
uces
0.20
puties
0.20
imos
0.20
uts
0.20
Activations Density 0.030%