INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     مرئيه
    -0.54
     Minaj
    -0.52
    writeFieldEnd
    -0.50
    stateProvider
    -0.48
    Aziz
    -0.47
     Rina
    -0.46
     getID
    -0.45
     Jasmin
    -0.45
     FMI
    -0.45
     виправивши
    -0.45
    POSITIVE LOGITS
    ough
    2.73
    OUGH
    1.96
    oughs
    1.89
    ought
    1.32
    rough
    1.30
    oug
    1.29
    ugh
    1.28
    augh
    1.23
     Hough
    1.19
    UGH
    1.14
    Act Density 0.007%

    No Known Activations