INDEX
Explanations
instances of negativity or critical sentiment
New Auto-Interp
Negative Logits
Nicarag
-0.74
Arabian
-0.72
EntityItem
-0.63
Dimension
-0.62
Millennium
-0.60
Galile
-0.60
PLoS
-0.60
Heavenly
-0.60
sodium
-0.59
Territory
-0.58
POSITIVE LOGITS
outs
1.23
out
1.20
away
1.16
up
1.15
around
1.14
offs
1.13
through
1.13
back
1.11
ups
1.10
forward
1.07
Activations Density 0.030%