INDEX
Explanations
phrases related to comparison or differentiation among multiple subjects
New Auto-Interp
Negative Logits
setempat
-0.55
للاسماء
-0.52
SOUNDBITE
-0.50
Token
-0.49
Scénario
-0.48
token
-0.48
"\
-0.47
kanı
-0.47
ROOT
-0.47
ză
-0.46
POSITIVE LOGITS
+#+#
0.89
expandindo
0.89
مشين
0.84
übrigen
0.82
BoxFit
0.81
ⓧ
0.81
himo
0.80
resten
0.80
utafitiHapana
0.80
Autres
0.78
Activations Density 0.294%