INDEX
    Explanations

    instances of deception or failed expectations

    New Auto-Interp
    Negative Logits
    #
    -0.72
     ujednoznacz
    -0.62
     protoimpl
    -0.58
    Datuak
    -0.58
     titolata
    -0.57
     gatto
    -0.56
    :][
    -0.52
     CreateTagHelper
    -0.51
     enfans
    -0.51
     figliu
    -0.51
    POSITIVE LOGITS
    bootstrapcdn
    0.59
    PathVariable
    0.58
    TagHelpers
    0.58
    ابراین
    0.56
    ಲ್ಲಿ
    0.54
     UserDao
    0.53
    LayoutInflater
    0.52
     slags
    0.52
     lock
    0.52
    perti
    0.52
    Act Density 1.021%

    No Known Activations