INDEX
    Explanations

    descriptions of experiencing something physically

    instances of experiencing or tasting new things

    New Auto-Interp
    Negative Logits
     blaming
    -0.85
    redict
    -0.69
     citing
    -0.68
    ItemImage
    -0.67
     refrain
    -0.67
     refusing
    -0.67
     claiming
    -0.67
    advertising
    -0.66
     blamed
    -0.64
     comprom
    -0.64
    POSITIVE LOGITS
     firsthand
    1.31
     glimps
    0.84
     prototypes
    0.84
     demos
    0.83
     preview
    0.78
     scenery
    0.78
     whats
    0.77
     majesty
    0.77
     goodies
    0.73
     glimpse
    0.73
    Act Density 0.406%

    No Known Activations