INDEX
Explanations
references to societal aspects
references to social concepts or societal issues
New Auto-Interp
Negative Logits
fluorescent
-0.72
PRES
-0.66
stump
-0.66
Grayson
-0.66
Garland
-0.63
knocking
-0.63
Rupert
-0.61
defective
-0.61
recall
-0.61
wounding
-0.60
POSITIVE LOGITS
ieties
1.39
ietal
1.37
iety
1.29
keye
1.28
iet
1.19
ionics
1.08
ivil
1.07
cer
1.02
iological
0.98
orp
0.92
Activations Density 0.044%