INDEX
    Explanations

    numeric values in text

    phrases indicating notable physical structures or objects

    New Auto-Interp
    Negative Logits
    afety
    -0.63
    vironment
    -0.62
    abases
    -0.60
     contingency
    -0.59
     sqor
    -0.59
     Policies
    -0.58
    humans
    -0.57
    posts
    -0.57
    ©¶æ¥µ
    -0.57
    ships
    -0.57
    POSITIVE LOGITS
     Shutterstock
    0.66
     symbol
    0.64
     adorned
    0.60
     emblem
    0.59
     Doodle
    0.56
     prest
    0.56
     reminiscent
    0.55
     pierced
    0.55
     haunting
    0.54
     pier
    0.54
    Act Density 1.496%

    No Known Activations