INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    oba
    -0.12
    ary
    -0.11
    uba
    -0.10
    tr
    -0.09
    ishi
    -0.09
    naires
    -0.09
    resi
    -0.09
    trer
    -0.09
     warped
    -0.09
     Hakk
    -0.08
    POSITIVE LOGITS
    (properties
    0.13
     properties
    0.11
    /functions
    0.10
    eties
    0.10
    (Properties
    0.10
    Ùħد
    0.09
    ouns
    0.09
    (Property
    0.09
    -properties
    0.09
    íĸ¥
    0.09
    Act Density 0.021%

    No Known Activations