INDEX
Explanations
phrases related to events resulting in harm or danger to individuals
references to "casual" incidents or events
New Auto-Interp
Negative Logits
Mercury
-0.68
tyrann
-0.67
unm
-0.66
Centauri
-0.64
fluorescent
-0.63
prescribing
-0.62
questioning
-0.62
irrad
-0.59
fixation
-0.59
deforestation
-0.59
POSITIVE LOGITS
inelli
1.20
imir
1.10
arella
1.09
ual
1.08
cas
1.05
itaire
1.03
arro
1.03
Royale
1.02
ierra
0.97
anova
0.96
Activations Density 0.012%