INDEX
    Explanations

    specific explicit sexual activities and related terms

    Text near percentages, deletions, or sex

    New Auto-Interp
    Negative Logits
    ">//
    -0.66
    openzeppelin
    -0.55
    AutoScaleMode
    -0.54
    //
    -0.53
    ">—
    -0.53
     Sando
    -0.53
     Roskov
    -0.51
    -0.50
    alva
    -0.48
     Bellow
    -0.47
    POSITIVE LOGITS
     surla
    0.64
    ImageContext
    0.63
    ScopeManager
    0.56
    gypti
    0.52
    obenzene
    0.52
    الحياه
    0.51
    édrale
    0.51
     aikaa
    0.50
    ernalia
    0.49
    })}$
    0.49
    Act Density 0.038%

    No Known Activations