INDEX
    Explanations

    significant ethical criticism

    New Auto-Interp
    Negative Logits
    eniu
    0.70
    0.67
    dater
    0.64
    0.64
     करवाया
    0.63
    nés
    0.63
     랜덤
    0.61
     negotiable
    0.61
    ພາບ
    0.60
    嬿
    0.59
    POSITIVE LOGITS
     criticism
    3.90
    critic
    3.81
     critic
    3.74
     criticize
    3.69
     criticisms
    3.66
     Critic
    3.64
     Criticism
    3.63
     critique
    3.63
    Critic
    3.60
     crítica
    3.56
    Act Density 0.663%

    No Known Activations