INDEX
    Explanations

    phrases related to dishonesty and insincerity

    New Auto-Interp
    Negative Logits
     createState
    -0.64
    ValueStyle
    -0.63
    adaptiveStyles
    -0.60
     jspb
    -0.59
     ComVisible
    -0.59
    RegistryLite
    -0.57
    +#+#
    -0.57
    jspx
    -0.56
    :+:
    -0.56
    <bos>
    -0.55
    POSITIVE LOGITS
     فريبيس
    0.59
     indisponible
    0.51
     whatever
    0.50
     volon
    0.48
    campista
    0.47
    abetes
    0.46
     חיצוניים
    0.45
     حل
    0.45
     bpy
    0.44
     Paused
    0.44
    Act Density 0.260%

    No Known Activations