INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.07
    еся
    -0.07
    Uuid
    -0.07
     prefer
    -0.06
    bsite
    -0.06
    blick
    -0.06
    equip
    -0.06
    ----------------------------------------------------------------
    -0.06
     ster
    -0.06
     поряд
    -0.06
    POSITIVE LOGITS
    most
    0.07
     laboratory
    0.07
     potentially
    0.07
     elabor
    0.06
    Localization
    0.06
     Neuroscience
    0.06
    ICS
    0.06
    0.06
     سود
    0.06
     особливо
    0.06
    Act Density 0.023%

    No Known Activations