INDEX
Explanations
the word "screen" with different variations in the text
mentions of "screen" in various contexts
New Auto-Interp
Negative Logits
doms
-0.77
ortium
-0.71
aternal
-0.71
ghan
-0.65
caution
-0.64
IGH
-0.62
ipop
-0.62
onomic
-0.61
antid
-0.61
Harriet
-0.60
POSITIVE LOGITS
plays
1.35
printed
1.04
tops
1.02
writers
1.00
writer
0.99
grab
0.96
caps
0.96
TVs
0.94
screens
0.92
writing
0.91
Activations Density 0.030%