INDEX
    Explanations

    positive descriptions of wines

    New Auto-Interp
    Negative Logits
    urette
    -0.15
    apter
    -0.14
    ilig
    -0.14
    ILA
    -0.14
     Norris
    -0.14
     voksne
    -0.13
    apel
    -0.13
    itre
    -0.13
    ipa
    -0.13
     Draft
    -0.13
    POSITIVE LOGITS
    iez
    0.15
    à¸Ľà¸£à¸°à¸ª
    0.15
    å©Ĩ
    0.15
    \core
    0.15
    ennis
    0.14
    ุà¹Ī
    0.14
     Dex
    0.14
    onde
    0.14
    inston
    0.14
    anzi
    0.14
    Act Density 0.002%

    No Known Activations