INDEX
Explanations
linguistic markers indicating species or biological classifications
New Auto-Interp
Negative Logits
alim
-0.08
owe
-0.06
aday
-0.06
upro
-0.06
Presence
-0.06
oÄįi
-0.06
bsp
-0.06
andel
-0.06
ofile
-0.06
obao
-0.06
POSITIVE LOGITS
ities
0.08
ity
0.07
eter
0.07
ital
0.07
394
0.06
arden
0.06
Ard
0.06
à¹Ģà¸Ħร
0.06
conc
0.06
Canter
0.06
Activations Density 0.001%