INDEX
Explanations
references to physical rooms
references to physical spaces described as "rooms."
New Auto-Interp
Negative Logits
Iss
-0.70
reconnect
-0.68
thood
-0.65
vengeance
-0.65
ename
-0.64
rebellious
-0.63
neurot
-0.62
usterity
-0.61
Uz
-0.61
haps
-0.61
POSITIVE LOGITS
room
1.26
room
1.22
Room
1.10
Room
1.07
rooms
1.04
rooms
0.89
Rooms
0.89
bell
0.88
bryce
0.82
floor
0.81
Activations Density 0.020%