INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    dire
    -0.09
    gie
    -0.08
    bibigay
    -0.08
     mors
    -0.08
     blocking
    -0.08
    marine
    -0.07
    kis
    -0.07
    pap
    -0.07
    /player
    -0.07
    owan
    -0.07
    POSITIVE LOGITS
    IK
    0.08
    brook
    0.08
    0.08
    0.08
    0.08
     electro
    0.08
     conscience
    0.08
    0.07
    用途
    0.07
    0.07
    Act Density 0.006%

    No Known Activations