INDEX
Explanations
phrases related to physical spaces, particularly rooms
references to specific rooms in various contexts
New Auto-Interp
Negative Logits
Iss
-0.70
Uz
-0.64
merce
-0.62
elson
-0.62
HER
-0.61
reconnect
-0.59
landfall
-0.58
ushima
-0.58
BAT
-0.58
Bonds
-0.58
POSITIVE LOGITS
room
1.10
occupancy
1.01
rooms
0.97
upstairs
0.92
divid
0.92
doors
0.92
rooms
0.88
itory
0.87
clock
0.87
lain
0.83
Activations Density 0.047%