INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     picked
    -0.07
     scattering
    -0.07
     Matching
    -0.07
    appro
    -0.07
     tích
    -0.06
    ר
    -0.06
    elden
    -0.06
    -0.06
     matching
    -0.06
     wallets
    -0.06
    POSITIVE LOGITS
     во
    0.09
    онт
    0.07
    0.07
     Во
    0.07
    Во
    0.07
    (hObject
    0.06
     않고
    0.06
    (qu
    0.06
    (arguments
    0.06
     verb
    0.06
    Act Density 0.002%

    No Known Activations