INDEX
Explanations
references to a particular television show
mentions of specific television shows
New Auto-Interp
Negative Logits
quez
-0.69
steel
-0.66
ntil
-0.65
solitary
-0.65
Lucia
-0.62
illard
-0.62
scarce
-0.62
Aval
-0.62
granite
-0.62
saline
-0.61
POSITIVE LOGITS
runners
1.60
biz
1.56
runner
1.55
manship
1.14
mable
1.11
room
1.04
case
0.99
girls
0.97
downs
0.95
rooms
0.95
Activations Density 0.043%