INDEX
Explanations
terms related to social and economic well-being
New Auto-Interp
Negative Logits
ANNER
-0.17
anner
-0.16
utsch
-0.16
ndl
-0.16
ýv
-0.15
ÄĽst
-0.15
ager
-0.15
edList
-0.14
Ù¾ÛĮر
-0.14
WithMany
-0.14
POSITIVE LOGITS
for
0.23
bagi
0.23
длÑı
0.23
dla
0.21
für
0.18
ç»Ļ
0.17
for
0.17
chez
0.16
对äºİ
0.16
728
0.16
Activations Density 0.301%