INDEX
    Explanations

    mentions of footwear, particularly boots

    New Auto-Interp
    Negative Logits
    terday
    -0.71
    udic
    -0.68
    ught
    -0.68
    Ĭ±
    -0.68
    enced
    -0.68
     NAD
    -0.67
    encers
    -0.67
     neurot
    -0.67
    LD
    -0.66
    kefeller
    -0.65
    POSITIVE LOGITS
    strap
    1.89
    loader
    1.14
    stra
    1.11
    legged
    1.03
    camp
    0.99
    leg
    0.88
    tails
    0.87
     loader
    0.83
    boot
    0.81
    lake
    0.81
    Act Density 0.012%

    No Known Activations