INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    entrada
    -0.07
     lumin
    -0.07
     nob
    -0.07
    RU
    -0.06
     flexDirection
    -0.06
     rapid
    -0.06
     cyan
    -0.06
     alloy
    -0.06
     Kab
    -0.06
    atég
    -0.06
    POSITIVE LOGITS
     stove
    0.20
     stom
    0.16
     kite
    0.13
     Stamford
    0.12
    ove
    0.09
    OVE
    0.08
    Forge
    0.07
    \HttpFoundation
    0.07
    ่ว
    0.07
    Descri
    0.06
    Act Density 0.002%

    No Known Activations