INDEX
    Explanations

    phrases indicating the use of deceptive or false information

    New Auto-Interp
    Negative Logits
    ConstraintMaker
    -0.67
    IntoConstraints
    -0.66
    migrationBuilder
    -0.60
    endphp
    -0.57
     BorderSide
    -0.55
    ]")]
    -0.54
    RenderAtEndOf
    -0.54
     EFE
    -0.54
    ^(@)
    -0.53
     abandonné
    -0.53
    POSITIVE LOGITS
    øk
    0.54
    Edited
    0.50
     Normdatei
    0.50
    FieldNumber
    0.49
     Италијани
    0.49
    OrBuilder
    0.48
     National
    0.48
     <=",
    0.48
    0.47
    وشن
    0.47
    Act Density 0.028%

    No Known Activations