INDEX
    Explanations

    humor and quirky descriptions

    New Auto-Interp
    Negative Logits
    вання
    1.25
     centroids
    1.22
    IL
    1.19
    ंना
    1.15
    Trước
    1.14
    ол
    1.12
    र्चा
    1.12
    ंच्या
    1.12
     alarmed
    1.12
     כאשר
    1.11
    POSITIVE LOGITS
     grueling
    1.62
    y
    1.45
     witty
    1.42
     gooey
    1.36
     irrever
    1.36
     quirky
    1.34
     quintessential
    1.33
     humor
    1.32
     whimsical
    1.28
     goofy
    1.28
    Act Density 0.002%

    No Known Activations