INDEX
    Explanations

    constructive feedback or criticism

    New Auto-Interp
    Negative Logits
    plicht
    0.41
    ื่อง
    0.39
     مظ
    0.38
    declare
    0.37
    toire
    0.36
    調べ
    0.36
    宣言
    0.36
     symmetrically
    0.36
    0.35
    ोसिएशन
    0.35
    POSITIVE LOGITS
     feedback
    3.27
     Feedback
    2.98
    Feedback
    2.95
    feedback
    2.92
    反馈
    2.69
     feedbacks
    2.41
     constructive
    1.91
     critique
    1.75
     critiques
    1.69
     criticism
    1.66
    Act Density 0.118%

    No Known Activations