INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     emiss
    -0.07
     fung
    -0.07
     scant
    -0.06
     Feng
    -0.06
     Mars
    -0.06
     fungi
    -0.06
     spa
    -0.06
     JA
    -0.06
    キャ
    -0.06
    fad
    -0.06
    POSITIVE LOGITS
     double
    0.09
     Double
    0.08
    increase
    0.08
     Small
    0.08
    ้ม
    0.07
    UPER
    0.07
     dbl
    0.07
    Private
    0.07
    _double
    0.07
    ppe
    0.07
    Act Density 0.016%

    No Known Activations