INDEX
Explanations
phrases mentioning actions related to windows
references to "windows"
New Auto-Interp
Negative Logits
ghan
-0.82
Downloadha
-0.78
doms
-0.76
ometown
-0.73
avez
-0.72
Flavoring
-0.71
zin
-0.70
icient
-0.70
agonist
-0.70
Haram
-0.70
POSITIVE LOGITS
pane
1.01
window
0.97
glass
0.96
windows
0.95
sill
0.94
wip
0.91
Window
0.89
window
0.87
ledge
0.82
edin
0.79
Activations Density 0.009%