INDEX
Explanations
mentions of human figures and their interactions in a narrative context
names and roles of individuals involved in an incident
New Auto-Interp
Negative Logits
fulfillment
-0.69
advertis
-0.69
Quantity
-0.66
hibition
-0.66
fulfil
-0.65
btn
-0.65
hibit
-0.63
tumblr
-0.63
problem
-0.62
funding
-0.61
POSITIVE LOGITS
alerted
1.18
yelled
1.14
rushed
1.13
witnessed
1.12
chased
1.11
evacuated
1.07
ejected
1.06
awoke
1.03
heard
1.03
spotted
1.01
Activations Density 0.263%