INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     WOM
    -0.08
    هوری
    -0.06
    kul
    -0.06
    บอล
    -0.06
     Hew
    -0.06
    kos
    -0.06
    enden
    -0.06
    owl
    -0.06
     lon
    -0.06
    Bounds
    -0.06
    POSITIVE LOGITS
     script
    0.10
    Script
    0.08
     Script
    0.08
     scripts
    0.08
    atat
    0.08
     SC
    0.07
     escri
    0.07
    .Script
    0.07
     parasite
    0.07
     SCRIPT
    0.07
    Act Density 0.017%

    No Known Activations