INDEX
Explanations
present-tense markers indicating ongoing actions or changes
New Auto-Interp
Negative Logits
vician
-0.56
出版年
-0.55
tuttavia
-0.54
mellett
-0.54
ursprünglich
-0.53
χρι
-0.53
nigdy
-0.52
sempat
-0.51
connexes
-0.51
хватает
-0.51
POSITIVE LOGITS
now
1.17
Теперь
1.15
Теперь
1.13
ahora
0.96
Ahora
0.95
теперь
0.94
Now
0.94
NOW
0.92
Now
0.91
now
0.91
Activations Density 0.359%