INDEX
Explanations
occurrences of specific prepositions and conjunctions in various languages, indicating focus on locative or comparative contexts
occurrences of specific names and references to notable entities or concepts
New Auto-Interp
Negative Logits
EconPapers
-0.90
Bask
-0.78
unitOfWork
-0.77
Irvin
-0.76
переди
-0.74
Corm
-0.74
čierna
-0.74
GOB
-0.74
..\..\
-0.72
oulder
-0.71
POSITIVE LOGITS
на
1.13
Auf
1.02
auf
0.98
Auf
0.91
На
0.88
naf
0.88
Katsu
0.87
На
0.86
na
0.83
ENAME
0.80
Activations Density 0.172%