INDEX
    Explanations

    question opening adjectives

    New Auto-Interp
    Negative Logits
    Advantages
    0.69
     özelli
    0.68
    )].
    0.65
    Successful
    0.65
     використа
    0.65
    advantages
    0.63
     özellikleri
    0.63
    Benefits
    0.62
     azok
    0.61
     제외
    0.61
    POSITIVE LOGITS
     big
    1.59
     goodies
    1.24
     nasty
    1.24
     sexy
    1.21
     murky
    1.21
     gooey
    1.19
     juicy
    1.18
     scary
    1.17
     messy
    1.17
    big
    1.16
    Act Density 0.225%

    No Known Activations