INDEX
Explanations
specific words related to rooms like "room", "roommate", "roomm," and "rooming."
New Auto-Interp
Negative Logits
HER
-1.00
indal
-0.93
³³³³³³³³
-0.90
elson
-0.89
usterity
-0.89
Accessed
-0.87
Uz
-0.86
TAG
-0.85
FF
-0.83
RBI
-0.82
POSITIVE LOGITS
room
1.52
rooms
1.39
doors
1.34
rooms
1.22
upstairs
1.09
stairs
1.09
room
1.08
floor
1.06
idges
1.05
clock
1.03
Activations Density 0.576%