INDEX
Explanations
references to audience and societal groups
New Auto-Interp
Negative Logits
vek
-0.07
arto
-0.07
660
-0.07
unto
-0.07
ÑģобÑĸ
-0.06
lopen
-0.06
ici
-0.06
İ
-0.06
åΰ
-0.06
ICI
-0.06
POSITIVE LOGITS
about
0.14
about
0.12
tentang
0.11
دربارÙĩ
0.10
_about
0.10
regarding
0.10
åħ³äºİ
0.10
concerning
0.09
-about
0.09
About
0.09
Activations Density 0.060%