INDEX
Explanations
foreign articles meaning "a" or "one"
New Auto-Interp
Negative Logits
you
0.68
başta
0.61
kojima
0.57
because
0.57
and
0.55
didn
0.55
with
0.55
+
0.55
there
0.54
লাগছে
0.54
POSITIVE LOGITS
sebuah
1.08
ενός
0.96
Một
0.93
một
0.93
isang
0.91
една
0.90
seorang
0.90
một
0.87
ਇੱਕ
0.86
ఒక
0.85
Activations Density 0.000%