INDEX
    Explanations

    specific words related to rooms like "room", "roommate", "roomm," and "rooming."

    New Auto-Interp
    Negative Logits
    HER
    -1.00
    indal
    -0.93
    ³³³³³³³³
    -0.90
    elson
    -0.89
    usterity
    -0.89
     Accessed
    -0.87
     Uz
    -0.86
    TAG
    -0.85
    FF
    -0.83
     RBI
    -0.82
    POSITIVE LOGITS
    room
    1.52
    rooms
    1.39
     doors
    1.34
     rooms
    1.22
     upstairs
    1.09
    stairs
    1.09
     room
    1.08
     floor
    1.06
    idges
    1.05
    clock
    1.03
    Act Density 0.576%

    No Known Activations