INDEX
    Explanations

    references to nighttime events or themes

    New Auto-Interp
    Negative Logits
     afternoon
    -0.70
     morning
    -0.65
     evening
    -0.63
     afternoons
    -0.61
     mornings
    -0.60
    下午
    -0.60
     daytime
    -0.59
    Afternoon
    -0.59
     evenings
    -0.57
     lunchtime
    -0.57
    POSITIVE LOGITS
     shift
    0.64
    gown
    0.63
    ingale
    0.61
    shift
    0.55
    mar
    0.54
     Shift
    0.52
    cap
    0.52
     sky
    0.49
    Shift
    0.46
    caps
    0.45
    Act Density 0.112%

    No Known Activations