INDEX
Explanations
phrases indicating superlatives or rankings
New Auto-Interp
Negative Logits
pmwiki
-0.76
hyde
-0.73
bleacher
-0.65
uyomi
-0.59
Cosponsors
-0.58
yna
-0.57
Unsure
-0.57
BAT
-0.57
ouched
-0.56
gow
-0.56
POSITIVE LOGITS
theirs
0.95
2017
0.93
2016
0.90
2015
0.90
2013
0.88
2014
0.88
all
0.84
any
0.83
hers
0.82
2012
0.80
Activations Density 0.062%