INDEX
Explanations
statements about individuals' identities, accomplishments, and historical significance
New Auto-Interp
Negative Logits
earn
-0.15
оÑĩеÑĢедÑĮ
-0.15
rych
-0.14
zar
-0.14
.uni
-0.14
tutar
-0.14
łí
-0.13
Previously
-0.13
ÙħØ´
-0.13
SYN
-0.13
POSITIVE LOGITS
вина
0.15
records
0.14
Forever
0.14
기ë¡Ŀ
0.14
kers
0.14
laid
0.14
ertia
0.14
à¸Ĺย
0.14
record
0.14
today
0.14
Activations Density 0.228%