INDEX
Explanations
instances of the word "windows"
references to "windows" in various contexts
New Auto-Interp
Negative Logits
avez
-0.80
ghan
-0.72
IGH
-0.72
ARA
-0.71
oku
-0.67
zin
-0.66
zing
-0.64
REDACTED
-0.64
yss
-0.63
recomm
-0.62
POSITIVE LOGITS
windows
1.02
wip
1.01
windows
0.98
pane
0.96
window
0.85
sill
0.85
glass
0.84
bars
0.83
door
0.82
openings
0.81
Activations Density 0.010%