INDEX
Explanations
Russian Cyrillic characters or words
Cyrillic characters or specific letters in Cyrillic scripts
New Auto-Interp
Negative Logits
gdala
-0.79
reads
-0.78
giving
-0.72
Joy
-0.69
menstrual
-0.69
berto
-0.68
iences
-0.65
wcs
-0.64
aith
-0.63
cause
-0.63
POSITIVE LOGITS
оÐ
1.27
о
1.17
а
1.15
и
1.13
л
1.13
ÑĤ
1.09
Ñģ
1.06
ÑĢ
1.05
Ñĥ
1.05
к
1.02
Activations Density 0.008%