INDEX
Explanations
words related to crises or critical situations
references to actions or conditions related to crisis and suffering
New Auto-Interp
Negative Logits
acqu
-0.73
recomm
-0.65
PROV
-0.63
dw
-0.63
rave
-0.59
âĢ¢âĢ¢
-0.57
Samar
-0.56
Renaissance
-0.56
raise
-0.55
adjusts
-0.55
POSITIVE LOGITS
xus
0.85
ļéĨĴ
0.75
atism
0.72
nces
0.72
clips
0.69
pled
0.68
illo
0.65
crew
0.65
essional
0.64
uled
0.63
Activations Density 0.055%