INDEX
Explanations
issues related to social problems and systemic inequalities
New Auto-Interp
Negative Logits
ullan
-0.17
üven
-0.16
èĦ
-0.15
äºī
-0.15
hk
-0.15
Essential
-0.14
tainted
-0.14
شاÙĩد
-0.14
utsch
-0.14
nearest
-0.14
POSITIVE LOGITS
serious
0.23
acute
0.23
magn
0.23
worse
0.22
grave
0.21
experienced
0.20
present
0.20
severe
0.20
accent
0.20
wors
0.20
Activations Density 0.270%