INDEX
    Explanations

    terms related to deception or dishonesty

    New Auto-Interp
    Negative Logits
     CanadaChoose
    -0.61
     Applies
    -0.49
     keny
    -0.49
     recite
    -0.47
     organise
    -0.47
     respira
    -0.47
     Proven
    -0.46
    росло
    -0.46
     invent
    -0.46
     contradict
    -0.46
    POSITIVE LOGITS
    addGap
    0.65
    protoimpl
    0.65
     dis
    0.63
     мәкал
    0.60
     kuiten
    0.58
     epä
    0.56
    AnchorStyles
    0.56
     excru
    0.56
    addPreferredGap
    0.54
    MessageTagHelper
    0.52
    Act Density 1.925%

    No Known Activations