INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     paralle
    -0.08
    -END
    -0.07
    Idx
    -0.07
     suited
    -0.07
    Ka
    -0.07
    agu
    -0.07
    -0.07
     tand
    -0.06
     veto
    -0.06
     strained
    -0.06
    POSITIVE LOGITS
     đàn
    0.06
     "*.
    0.06
     toolStrip
    0.06
     diện
    0.06
    IfExists
    0.06
    0.06
    อำ
    0.06
     câu
    0.06
    Connor
    0.06
    AD
    0.06
    Act Density 0.083%

    No Known Activations