INDEX
Negative Logits
brim
-0.67
thrill
-0.65
ado
-0.62
roar
-0.62
prey
-0.62
Tuls
-0.62
exagger
-0.61
concede
-0.61
rooting
-0.60
bribes
-0.60
POSITIVE LOGITS
2015
1.26
2017
1.24
2016
1.19
2014
1.19
2012
1.16
2013
1.16
2001
1.15
2018
1.15
2006
1.13
2008
1.11
Activations Density 0.043%