INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    aira
    -0.08
     Global
    -0.07
    fait
    -0.07
    ARA
    -0.07
     крит
    -0.07
    Holiday
    -0.07
    _grupo
    -0.07
    عر
    -0.06
    irma
    -0.06
    @NoArgsConstructor
    -0.06
    POSITIVE LOGITS
     Pen
    0.18
     pen
    0.18
    Pen
    0.13
     pens
    0.13
    pen
    0.12
     penned
    0.11
     Penn
    0.10
     PEN
    0.09
     Pens
    0.09
    .pen
    0.09
    Act Density 0.010%

    No Known Activations