INDEX
Explanations
Slavic languages and related terms
references to specific words or phrases in a language, potentially indicating a linguistic focus or context
New Auto-Interp
Negative Logits
ayer
-0.83
arton
-0.77
additive
-0.71
¿½
-0.70
onsense
-0.66
cause
-0.66
reads
-0.65
buquerque
-0.64
ACTED
-0.64
guyen
-0.62
POSITIVE LOGITS
оÐ
1.19
о
1.07
ÑĢ
1.07
л
1.03
а
1.00
Ñĥ
0.98
и
0.97
в
0.85
ÑĤ
0.82
eers
0.80
Activations Density 0.022%