INDEX
Explanations
verbs related to communication and interaction
instances of reported alarming events or crises
New Auto-Interp
Negative Logits
depends
-0.62
estern
-0.56
Currently
-0.54
quartered
-0.54
undrum
-0.53
ilial
-0.53
Allaah
-0.52
pires
-0.50
rentices
-0.50
icka
-0.50
POSITIVE LOGITS
theirs
0.62
}.
0.62
scrut
0.58
panicked
0.57
their
0.56
]).
0.56
afterward
0.56
culminating
0.54
enthusi
0.54
unison
0.54
Activations Density 1.635%