INDEX
Explanations
negative sentiments or expressions
New Auto-Interp
Negative Logits
Confederation
-0.74
Howe
-0.66
ĨĴ
-0.64
Borders
-0.64
Cause
-0.63
HCR
-0.62
existence
-0.61
constitu
-0.60
congr
-0.60
Province
-0.60
POSITIVE LOGITS
down
1.17
happy
1.13
downs
1.09
out
1.07
dash
1.06
outs
1.05
starting
1.05
back
1.05
hitting
1.05
backs
1.02
Activations Density 0.035%