INDEX
Explanations
mentions of euthanizing or related terms
references to euthanasia
New Auto-Interp
Negative Logits
Dynamics
-0.73
Belt
-0.72
اÙĦ
-0.71
Soda
-0.70
Painter
-0.69
Dress
-0.66
Pell
-0.65
Brand
-0.65
Collider
-0.65
Shirt
-0.64
POSITIVE LOGITS
anasia
1.56
euth
0.95
onym
0.91
umatic
0.84
rieved
0.83
oenix
0.83
umbn
0.82
isec
0.80
onomic
0.79
ADRA
0.77
Activations Density 0.021%