INDEX
    Explanations

    references to secrets and hidden information

    New Auto-Interp
    Negative Logits
     ddelwed
    -0.64
     ſind
    -0.63
    bibfield
    -0.60
     ſehen
    -0.60
     Spouse
    -0.59
     BorderSide
    -0.59
    UrlResolution
    -0.58
    ٞ
    -0.58
    spouse
    -0.58
     Waſſer
    -0.57
    POSITIVE LOGITS
     secret
    1.68
    Secret
    1.53
     Secret
    1.52
    secret
    1.50
     SECRET
    1.47
     secrets
    1.38
    SECRET
    1.27
     secreto
    1.25
     Secrets
    1.23
    Secrets
    1.20
    Act Density 0.175%

    No Known Activations