INDEX
Explanations
phrases related to negative impacts or setbacks
references to negative impacts or setbacks
New Auto-Interp
Negative Logits
iosity
-0.75
âĸ¬
-0.70
ordan
-0.68
Govern
-0.67
nesota
-0.65
Purpose
-0.65
natureconservancy
-0.64
ript
-0.63
Suc
-0.63
uana
-0.63
POSITIVE LOGITS
hole
1.01
blow
0.94
guns
0.93
gun
0.92
hard
0.89
pipe
0.88
blow
0.87
outs
0.87
holes
0.87
inflicted
0.86
Activations Density 0.014%