INDEX
    Explanations

    references to self-referential phrases or concepts

    New Auto-Interp
    Negative Logits
    الة
    -0.57
    ppelin
    -0.56
    %");
    -0.54
     signUp
    -0.52
    kaw
    -0.52
    chette
    -0.52
    %")
    -0.52
    ğer
    -0.51
    ...?"
    -0.51
    deko
    -0.51
    POSITIVE LOGITS
    Personendaten
    1.02
    AndEndTag
    0.95
    self
    0.93
    DoubleQuotes
    0.84
    __":
    0.75
    новништво
    0.74
     дописавши
    0.73
     AssemblyCulture
    0.71
     بيها
    0.70
    UserScript
    0.69
    Act Density 0.028%

    No Known Activations