INDEX
Explanations
intensifiers that convey strong emotions or opinions
New Auto-Interp
Negative Logits
antro
-0.17
ãģĦãĤĭ
-0.15
izzo
-0.14
ÙĪØ²
-0.14
èŃ·
-0.14
;base
-0.14
Ñħод
-0.14
à¥Īसल
-0.14
hea
-0.14
изнеÑģ
-0.14
POSITIVE LOGITS
ething
0.16
ĶåĽŀ
0.15
quier
0.14
ανά
0.13
ارة
0.13
IDEO
0.13
ienes
0.13
reau
0.13
á»įng
0.13
YRO
0.13
Activations Density 0.015%