INDEX
    Explanations

    terms related to nourishment and replenishment

    New Auto-Interp
    Negative Logits
    onto
    -0.16
    akers
    -0.15
     wearing
    -0.15
     ic
    -0.15
    ability
    -0.15
    ually
    -0.14
    ordes
    -0.14
    atically
    -0.14
    ане
    -0.14
    plane
    -0.14
    POSITIVE LOGITS
    ishing
    0.68
    ished
    0.66
    ishment
    0.64
    ish
    0.60
    ishes
    0.54
    ISH
    0.50
    ishments
    0.48
    isher
    0.48
    ISHED
    0.47
    lish
    0.38
    Act Density 0.027%

    No Known Activations