INDEX
    Explanations

    negative sentiment or adversely connotative phrases

    New Auto-Interp
    Negative Logits
    Autoritní
    -0.60
    didSet
    -0.60
    ngdoc
    -0.57
    xFFFFFF
    -0.56
     nargin
    -0.54
    érience
    -0.54
     latest
    -0.53
     report
    -0.52
    Vía
    -0.51
    onSave
    -0.50
    POSITIVE LOGITS
     noix
    0.52
     indisponible
    0.51
    \}\\
    0.50
    ècie
    0.49
    ViewImports
    0.48
    }>;
    0.48
     beziehungs
    0.48
     chi̍t
    0.48
    __);
    0.47
    __',
    0.46
    Act Density 0.065%

    No Known Activations