INDEX
Explanations
translations and foreign languages
New Auto-Interp
Negative Logits
ennzeichnet
0.45
iotsitewise
0.41
correctement
0.41
سٹ
0.40
ferries
0.40
berhasil
0.39
السبب
0.38
Bulld
0.38
认真
0.37
ஏன்
0.37
POSITIVE LOGITS
for
0.61
для
0.61
για
0.57
für
0.57
для
0.55
in
0.52
across
0.49
в
0.49
براي
0.49
under
0.48
Activations Density 0.002%