INDEX
    Explanations

    instances of deception or trickery

    Deception, trickery, or being misled

    New Auto-Interp
    Negative Logits
     înal
    -0.47
     gesteld
    -0.45
     betrekking
    -0.43
     uitges
    -0.42
     betreft
    -0.41
    forte
    -0.41
     Linton
    -0.41
     behov
    -0.41
     Tennyson
    -0.40
    ennia
    -0.40
    POSITIVE LOGITS
     trick
    0.98
     tricks
    0.90
     Trick
    0.79
     Tricks
    0.78
    trick
    0.76
     sneaky
    0.75
     tricked
    0.75
     trucos
    0.71
    Trick
    0.71
     scam
    0.69
    Act Density 0.462%

    No Known Activations