INDEX
Explanations
negative sentiment or criticism towards various topics
New Auto-Interp
Negative Logits
moder
-0.72
thumbs
-0.68
interf
-0.67
odan
-0.66
tolerance
-0.65
scanner
-0.62
antiv
-0.61
scales
-0.61
thumb
-0.60
numerical
-0.60
POSITIVE LOGITS
2016
1.16
2017
1.15
2014
1.10
2018
1.09
2020
1.07
2011
1.06
2012
1.06
2015
1.04
2013
1.03
2010
1.00
Activations Density 0.016%