INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     kuk
    -0.06
    -0.06
     전문
    -0.06
    _fre
    -0.06
     rằng
    -0.06
    .enc
    -0.06
     Assigned
    -0.06
    -0.06
     frosting
    -0.06
     ==>
    -0.06
    POSITIVE LOGITS
    WOOD
    0.07
    kiego
    0.07
    _YELLOW
    0.07
    -package
    0.07
    ulus
    0.07
    apgolly
    0.06
    entence
    0.06
    romatic
    0.06
     CONDITION
    0.06
    kehr
    0.06
    Act Density 0.000%

    No Known Activations