INDEX
    Explanations

    expressions of positive sentiment and evaluations

    New Auto-Interp
    Negative Logits
    agan
    -0.14
     unforgettable
    -0.14
    ollar
    -0.13
    è¨Ģãģ£ãģŁ
    -0.13
     unfavor
    -0.13
     astonished
    -0.13
    íļ¨
    -0.13
    achten
    -0.13
    Argb
    -0.13
    olid
    -0.12
    POSITIVE LOGITS
     nice
    0.38
     heart
    0.31
    nice
    0.29
    Nice
    0.28
     grat
    0.28
     Nice
    0.27
     great
    0.26
     pleasing
    0.26
     encouraging
    0.26
     neat
    0.24
    Act Density 0.146%

    No Known Activations