INDEX
Explanations
references to morning shows or programs
prominent television show titles and related phrases
New Auto-Interp
Negative Logits
wid
-0.64
illon
-0.60
USS
-0.59
hover
-0.57
guiActiveUn
-0.57
eligible
-0.56
Ter
-0.56
igham
-0.56
quet
-0.55
icum
-0.55
POSITIVE LOGITS
Practices
0.92
nered
0.88
outweigh
0.84
ounters
0.82
outwe
0.81
atoes
0.80
ornings
0.80
bye
0.70
anners
0.68
smanship
0.67
Activations Density 0.304%