INDEX
Explanations
details about people and their actions
references to victims and their backgrounds
New Auto-Interp
Negative Logits
eur
-0.57
regulation
-0.57
prem
-0.57
ifice
-0.56
banning
-0.55
Insert
-0.55
Limit
-0.54
Built
-0.54
forcement
-0.54
PRESS
-0.54
POSITIVE LOGITS
selves
0.96
disembark
0.94
individually
0.89
congreg
0.89
evacuated
0.85
numbered
0.84
surn
0.81
themselves
0.80
dispersed
0.79
volunte
0.79
Activations Density 0.718%