INDEX
Explanations
words related to crises or serious issues
language related to crises and their impacts on society
New Auto-Interp
Negative Logits
representations
-0.74
endment
-0.73
conversions
-0.68
dab
-0.64
promotional
-0.64
urai
-0.64
nods
-0.63
Creed
-0.63
accur
-0.62
nic
-0.62
POSITIVE LOGITS
worsened
1.36
worsen
1.27
engulf
1.19
exacerbated
1.18
worsening
1.13
plag
1.08
aggravated
1.03
epidemic
1.01
compounded
1.00
severity
0.99
Activations Density 0.478%