INDEX
Explanations
expressions of emotions and subjective opinions
New Auto-Interp
Negative Logits
ÄĽÅ¾
-0.14
achat
-0.14
alez
-0.13
κÏĮ
-0.13
raid
-0.12
lopen
-0.12
illet
-0.12
nelle
-0.11
responseObject
-0.11
ůž
-0.11
POSITIVE LOGITS
about
1.40
about
1.15
About
1.02
About
1.00
ABOUT
0.98
_about
0.97
tentang
0.90
åħ³äºİ
0.89
-about
0.88
.about
0.85
Activations Density 1.756%