INDEX
Explanations
mentions of words related to screens or scripts
references to "screen" or "screenwriters."
New Auto-Interp
Negative Logits
ipop
-0.74
ortium
-0.72
doms
-0.71
ghan
-0.71
Harriet
-0.69
deleg
-0.68
IGH
-0.67
ciating
-0.66
thens
-0.64
hypoc
-0.63
POSITIVE LOGITS
plays
1.28
screens
1.05
screen
1.04
screen
1.00
writers
0.97
writer
0.95
TVs
0.95
room
0.87
Screen
0.85
reens
0.83
Activations Density 0.012%