INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    bes
    -0.18
    ning
    -0.17
    fully
    -0.16
    ners
    -0.16
    esen
    -0.16
    ally
    -0.15
    iger
    -0.15
    sects
    -0.15
    mie
    -0.15
     taj
    -0.14
    POSITIVE LOGITS
    house
    0.28
    houses
    0.24
     beans
    0.24
     grounds
    0.23
     bean
    0.22
    bean
    0.21
     Bean
    0.21
    HOUSE
    0.21
    beans
    0.21
    /es
    0.20
    Act Density 0.009%

    No Known Activations