INDEX
Explanations
instances where phrases mention groups of people or time periods
phrases indicating common experiences or sentiments shared by many people
New Auto-Interp
Negative Logits
bil
-0.72
obin
-0.69
Tro
-0.63
mosp
-0.63
foremost
-0.62
steal
-0.60
rout
-0.60
sequ
-0.58
gart
-0.58
bid
-0.57
POSITIVE LOGITS
sake
1.24
purposes
1.17
reasons
0.89
ummies
0.89
occasions
0.79
foreseeable
0.77
eternity
0.72
duration
0.71
ulz
0.70
households
0.68
Activations Density 0.080%