INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     ammonia
    -0.09
     rainforest
    -0.08
    rowave
    -0.08
     Toch
    -0.08
    nova
    -0.08
     yen
    -0.08
    m
    -0.07
    Ban
    -0.07
     auk
    -0.07
    Basket
    -0.07
    POSITIVE LOGITS
    (inv
    0.08
    0.08
    0.08
    iciones
    0.07
    (letter
    0.07
     Etc
    0.07
    0.07
    _letter
    0.07
     Sang
    0.07
     letter
    0.07
    Act Density 0.001%

    No Known Activations