INDEX
Explanations
victims of various situations or issues
references to victims of various crimes and injustices
New Auto-Interp
Negative Logits
oday
-0.82
MX
-0.76
yip
-0.74
soDeliveryDate
-0.74
ulum
-0.74
readable
-0.70
ortment
-0.69
ary
-0.69
heit
-0.68
rier
-0.68
POSITIVE LOGITS
ingest
0.80
gou
0.71
metic
0.69
psych
0.68
suff
0.66
untreated
0.66
displacement
0.64
starvation
0.63
Ö¼
0.63
either
0.63
Activations Density 0.083%