INDEX
Explanations
mentions or references to TV shows, especially "The Daily Show"
mentions of television shows
New Auto-Interp
Negative Logits
membranes
-0.74
arta
-0.66
cogn
-0.65
liberties
-0.62
agric
-0.61
Ì
-0.60
paralle
-0.60
insofar
-0.60
RIC
-0.58
solvent
-0.58
POSITIVE LOGITS
Show
3.77
Show
2.43
SHOW
2.27
show
2.27
Shows
2.06
show
1.96
shows
1.76
Hide
1.25
shows
1.25
Hide
1.23
Activations Density 0.015%