INDEX
    Explanations

    Figure skating and gymnastics

    New Auto-Interp
    Negative Logits
    альних
    -0.08
    emption
    -0.07
    attack
    -0.06
    -0.06
     GAP
    -0.06
     Selection
    -0.06
    aux
    -0.06
     Bad
    -0.06
    victim
    -0.06
    анные
    -0.06
    POSITIVE LOGITS
    orrh
    0.07
    .ws
    0.06
     brows
    0.06
     PRODUCT
    0.06
    τοκ
    0.06
    0.06
    cb
    0.06
    ieved
    0.06
    unce
    0.06
    items
    0.06
    Act Density 0.002%

    No Known Activations