INDEX
Explanations
dates and historical references
New Auto-Interp
Negative Logits
201
-0.16
newcom
-0.14
ofilm
-0.14
Cummings
-0.14
ERN
-0.14
iek
-0.14
agnar
-0.14
Vo
-0.14
hr
-0.14
imax
-0.13
POSITIVE LOGITS
ifestyles
0.19
ependency
0.15
Insensitive
0.15
Riot
0.15
à¤ĸ
0.14
ÙĨÛĮ
0.14
incer
0.14
esc
0.14
Projectile
0.13
adir
0.13
Activations Density 0.086%