INDEX
Explanations
mentions of healthcare and social issues
New Auto-Interp
Negative Logits
ird
-0.17
á»§
-0.16
Wind
-0.15
elite
-0.14
fault
-0.14
авиÑģ
-0.13
¬ģ
-0.13
çŁ¢
-0.13
Animator
-0.13
ht
-0.13
POSITIVE LOGITS
för
0.17
Wikispecies
0.17
tax
0.16
ispecies
0.16
.tax
0.15
_tax
0.15
_NC
0.15
amient
0.15
miêu
0.14
PIE
0.14
Activations Density 0.015%