INDEX
Explanations
expressions or mentions of the Russian language or culture
Russian language identifiers
New Auto-Interp
Negative Logits
رشف
-0.61
LookAnd
-0.48
Unterscheidung
-0.47
kyse
-0.46
acompañado
-0.45
følgelig
-0.44
acompañada
-0.44
Belakang
-0.43
Література
-0.42
signos
-0.41
POSITIVE LOGITS
Ru
0.60
RU
0.59
Ru
0.57
RU
0.54
ru
0.54
ru
0.54
Rus
0.52
Sule
0.49
useParams
0.48
sarcoma
0.48
Activations Density 0.000%