INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    skou
    -0.18
    ein
    -0.17
    sam
    -0.16
    ecast
    -0.16
    enerator
    -0.16
    yper
    -0.15
    sb
    -0.15
    ÃĹ↵↵
    -0.15
    sit
    -0.15
     entr
    -0.15
    POSITIVE LOGITS
    dives
    0.39
    ibu
    0.33
    awi
    0.32
    vern
    0.30
    tes
    0.29
    abar
    0.28
    nutrition
    0.27
    function
    0.26
    practice
    0.26
    aga
    0.25
    Act Density 0.011%

    No Known Activations