INDEX
    Explanations

    words and phrases related to ratings and feedback

    New Auto-Interp
    Negative Logits
     تضيفلها
    -0.74
     in
    -0.66
     吗
    -0.60
    ValueStyle
    -0.59
    estan
    -0.59
     known
    -0.57
     समीक्षाओं
    -0.57
    ništ
    -0.56
     here
    -0.56
    simmon
    -0.56
    POSITIVE LOGITS
     Inſ
    0.91
     Reſ
    0.90
     ſche
    0.89
     auffi
    0.87
     ſtate
    0.85
     Houſe
    0.84
     Diſ
    0.83
     dezelve
    0.83
     myſelf
    0.82
     ſeveral
    0.79
    Act Density 0.210%

    No Known Activations