INDEX
    Explanations

    expressing admiration for your work

    New Auto-Interp
    Negative Logits
    চ্ছুক
    0.43
     Worse
    0.41
    ಬೇಕ
    0.41
     pertains
    0.40
     likely
    0.40
    ll
    0.39
     impair
    0.39
     গুরুতর
    0.39
     conceivably
    0.39
     expects
    0.39
    POSITIVE LOGITS
    0.55
     özellikle
    0.51
     simplicity
    0.50
     особенно
    0.49
     openness
    0.47
     originality
    0.47
     идея
    0.46
     стиль
    0.46
     രീതി
    0.45
    собенно
    0.45
    Act Density 0.058%

    No Known Activations