INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    venants
    -0.76
    redit
    -0.75
     Borders
    -0.75
    ledged
    -0.73
    founded
    -0.69
    ITNESS
    -0.67
     Witness
    -0.67
    hovah
    -0.65
    urrencies
    -0.64
    rahim
    -0.62
    POSITIVE LOGITS
     dressing
    1.05
     greens
    1.02
     salad
    1.01
     salads
    0.98
     bowl
    0.96
    bowl
    0.94
     dress
    0.90
    eria
    0.88
     veggies
    0.83
     Bowl
    0.82
    Act Density 0.004%

    No Known Activations