INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    InitVars
    -0.74
    umumkan
    -0.71
     Sequence
    -0.66
    spac
    -0.62
     suspending
    -0.61
     suspensión
    -0.60
     enrolling
    -0.60
     sequences
    -0.59
    TagMode
    -0.59
    sequences
    -0.59
    POSITIVE LOGITS
     recipe
    1.78
     Recipe
    1.64
    recipe
    1.63
     recipes
    1.60
    Recipe
    1.51
     Recipes
    1.46
     RECIPE
    1.39
    Recipes
    1.22
    recipes
    1.22
     RECIPES
    1.09
    Act Density 0.061%

    No Known Activations