INDEX
    Explanations

    references to bathroom-related items and activities

    New Auto-Interp
    Negative Logits
     Torch
    -0.15
    essor
    -0.15
    INUX
    -0.15
     Tribal
    -0.15
    èĥ¸
    -0.15
    ulace
    -0.14
    alaria
    -0.14
    pike
    -0.14
    dj
    -0.14
    Helmet
    -0.13
    POSITIVE LOGITS
     toilet
    0.36
     seat
    0.34
     bowl
    0.32
     toilets
    0.31
     Toilet
    0.31
     flushing
    0.30
     flush
    0.29
     bid
    0.28
     Seat
    0.27
    Seat
    0.27
    Act Density 0.006%

    No Known Activations