INDEX
Explanations
mentions of events or incidents involving death or serious injury
phrases related to events that occur after a specified incident
New Auto-Interp
Negative Logits
isa
-0.82
aez
-0.79
女
-0.74
available
-0.74
ertain
-0.73
OE
-0.73
ãĥ¼
-0.72
Else
-0.72
CHR
-0.72
ophobia
-0.72
POSITIVE LOGITS
ingest
0.99
consuming
0.92
sustaining
0.92
receiving
0.88
undergoing
0.88
encountering
0.88
completing
0.87
witnessing
0.87
collapsing
0.87
exchanging
0.86
Activations Density 0.109%