INDEX
Explanations
references to physical rooms and locations
references to specific room numbers and their context
New Auto-Interp
Negative Logits
HER
-0.74
Uz
-0.69
Iss
-0.67
merce
-0.64
Bonds
-0.63
pend
-0.63
elson
-0.62
usterity
-0.60
retribution
-0.55
pleting
-0.55
POSITIVE LOGITS
room
1.08
rooms
0.94
doors
0.93
rooms
0.91
upstairs
0.87
room
0.86
lain
0.86
divid
0.85
occupancy
0.84
itory
0.84
Activations Density 0.060%