INDEX
Explanations
mentions of a television or radio program
references to television shows
New Auto-Interp
Negative Logits
etheless
-0.95
litter
-0.79
assing
-0.72
saline
-0.71
bilingual
-0.70
tactile
-0.69
mint
-0.69
axy
-0.69
notor
-0.69
redundancy
-0.68
POSITIVE LOGITS
biz
1.23
case
1.19
alter
1.13
cases
1.10
Tycoon
1.07
runners
1.01
ing
0.97
grounds
0.92
boat
0.90
runner
0.86
Activations Density 0.022%