INDEX
    Explanations

    negative connotations or complaints about various subjects

    New Auto-Interp
    Negative Logits
    purl
    -0.79
     الحره
    -0.76
    ensaft
    -0.71
    TagMode
    -0.69
    ruvate
    -0.66
    fillColor
    -0.66
    ウィキ
    -0.64
    LayoutStyle
    -0.63
    Spoljašnje
    -0.62
     Daerah
    -0.61
    POSITIVE LOGITS
     -
    0.98
    ">-
    0.89
     '-
    0.87
    /-
    0.81
     "-
    0.79
    >-</
    0.78
    ..-
    0.78
    .-
    0.76
    ('-
    0.76
    ----------------
    0.73
    Act Density 0.072%

    No Known Activations