INDEX
    Explanations

    phrases related to examining or assessing something closely

    New Auto-Interp
    Negative Logits
     Appearance
    -0.36
     appearance
    -0.35
    Appearance
    -0.33
    appearance
    -0.32
     appearances
    -0.32
     appearing
    -0.27
    appear
    -0.27
     appears
    -0.26
     Appears
    -0.26
     appear
    -0.25
    POSITIVE LOGITS
     look
    0.62
    look
    0.50
     Look
    0.49
    Look
    0.45
     LOOK
    0.43
    _look
    0.42
    .look
    0.42
     looked
    0.41
     looks
    0.41
     looking
    0.41
    Act Density 0.068%

    No Known Activations