INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Armen
    -0.07
     Yaz
    -0.07
     duyg
    -0.06
    ebek
    -0.06
    covers
    -0.06
    isOk
    -0.06
    DIV
    -0.06
    Ign
    -0.06
    _PRESENT
    -0.06
    architecture
    -0.06
    POSITIVE LOGITS
     rays
    0.07
     dosud
    0.07
    adget
    0.06
    _gene
    0.06
    _actor
    0.06
    reshold
    0.06
    (Point
    0.06
    νε
    0.06
    port
    0.06
    ipi
    0.06
    Act Density 0.011%

    No Known Activations