INDEX
Explanations
references to rooms or living spaces
New Auto-Interp
Negative Logits
)";
-1.07
!")
-0.99
)');
-0.99
)";
-0.92
]';
-0.90
()]
-0.90
)");
-0.89
</tfoot>
-0.87
)»
-0.86
")";
-0.86
POSITIVE LOGITS
Room
1.84
rooms
1.79
Rooms
1.77
room
1.70
ROOM
1.67
Room
1.65
Rooms
1.64
rooms
1.56
room
1.47
ROOM
1.43
Activations Density 0.037%