INDEX
    Explanations

    references to rooms or living spaces

    New Auto-Interp
    Negative Logits
    )";
    
    -1.07
    !")
    
    -0.99
    )');
    -0.99
    )";
    -0.92
    ]';
    -0.90
    ()]
    
    -0.90
    )");
    
    -0.89
    </tfoot>
    -0.87
    -0.86
     ")";
    -0.86
    POSITIVE LOGITS
     Room
    1.84
     rooms
    1.79
     Rooms
    1.77
     room
    1.70
     ROOM
    1.67
    Room
    1.65
    Rooms
    1.64
    rooms
    1.56
    room
    1.47
    ROOM
    1.43
    Act Density 0.037%

    No Known Activations