INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    AndEndTag
    -0.82
    IsContent
    -0.80
     ſeveral
    -0.78
     Jefus
    -0.77
    новниш
    -0.77
     Monfieur
    -0.75
    تفصیلات
    -0.74
     myſelf
    -0.74
     uſed
    -0.74
     uſe
    -0.74
    POSITIVE LOGITS
     image
    0.58
     an
    0.52
     ph
    0.52
     images
    0.49
     app
    0.49
    0.49
     mental
    0.48
    inverted
    0.47
     virtual
    0.47
     e
    0.43
    Act Density 0.004%

    No Known Activations