INDEX
Explanations
words related to locker rooms
references to locker rooms
New Auto-Interp
Negative Logits
nel
-0.76
ulhu
-0.76
hillary
-0.72
PsyNetMessage
-0.69
alez
-0.66
debian
-0.66
nom
-0.66
overc
-0.63
DonaldTrump
-0.62
Leilan
-0.62
POSITIVE LOGITS
locker
1.14
room
0.93
room
0.92
rooms
0.91
ysis
0.81
closet
0.81
drawer
0.80
lain
0.79
Room
0.77
bie
0.77
Activations Density 0.031%