INDEX
Explanations
significant themes or reminders within text related to health and societal issues
New Auto-Interp
Negative Logits
ynet
-0.17
lopedia
-0.17
ÑģÑĤÑĢ
-0.16
holm
-0.15
olen
-0.15
iven
-0.14
zeich
-0.14
ÙIJÙĨ
-0.14
à¤Ĥà¤Ł
-0.14
ãĥ¡ãĥ³ãĥĪ
-0.14
POSITIVE LOGITS
isci
0.18
arella
0.15
587
0.15
cone
0.15
ırak
0.14
olle
0.14
ży
0.14
CI
0.14
cone
0.14
hud
0.14
Activations Density 0.411%