INDEX
Explanations
issues related to violence, abuse, and societal injustices
New Auto-Interp
Negative Logits
UTC
-0.71
MET
-0.70
Export
-0.70
Ry
-0.69
details
-0.67
Cal
-0.64
Shares
-0.63
Pitt
-0.62
Fish
-0.61
SCP
-0.61
POSITIVE LOGITS
trauma
0.89
syndrome
0.86
ukemia
0.78
Syndrome
0.77
agar
0.73
fame
0.73
illness
0.71
abroad
0.69
oppression
0.69
sickness
0.68
Activations Density 0.198%