INDEX
Explanations
words related to evaluation and assessment outcomes
New Auto-Interp
Negative Logits
Bye
-0.16
InSection
-0.13
celik
-0.13
dostan
-0.13
uba
-0.13
ogr
-0.12
bÃŃ
-0.12
spy
-0.12
ãģ¤ãģij
-0.12
alars
-0.12
POSITIVE LOGITS
by
0.88
oleh
0.73
تÙĪØ³Ø·
0.66
bợi
0.60
by
0.46
بÙĪØ§Ø³Ø·Ø©
0.44
tarafından
0.44
_by
0.41
ìĿĺíķ´
0.40
przez
0.39
Activations Density 0.638%