INDEX
    Explanations

    references to fruit in various contexts

    New Auto-Interp
    Negative Logits
    }}}$
    -0.60
    ρε
    -0.57
    ("${
    -0.52
     endfor
    -0.52
     stiletto
    -0.51
    nthe
    -0.51
     Amazonas
    -0.51
    }$}
    -0.51
    glfw
    -0.50
    versy
    -0.50
    POSITIVE LOGITS
     fruits
    1.12
     fruit
    1.11
     Fruit
    1.04
    Fruits
    1.00
     Fruits
    0.98
     FRUIT
    0.97
    Fruit
    0.97
     fru
    0.94
     orchards
    0.92
    fruit
    0.90
    Act Density 0.096%

    No Known Activations