INDEX
    Explanations

    the presence of the substring "ent"

    New Auto-Interp
    Negative Logits
    pent
    -0.17
    buz
    -0.16
    .criteria
    -0.15
    ibox
    -0.14
    .creation
    -0.14
    bai
    -0.14
    buah
    -0.14
     Spice
    -0.14
    aksi
    -0.14
     dyn
    -0.14
    POSITIVE LOGITS
    mit
    0.15
    ubits
    0.15
    ende
    0.15
    ippo
    0.14
    ish
    0.14
    Ïģιν
    0.14
    alam
    0.14
     Lage
    0.14
    chin
    0.14
    λλη
    0.14
    Act Density 0.000%

    No Known Activations