INDEX
    Explanations

    sentences that convey positive evaluations or experiences

    New Auto-Interp
    Negative Logits
    Przypisy
    -0.63
    ̣i
    -0.57
    rophore
    -0.54
     dernières
    -0.52
    ValueStyle
    -0.51
    shafen
    -0.51
     tarvit
    -0.51
     varsa
    -0.51
    كذا
    -0.50
     الدولى
    -0.49
    POSITIVE LOGITS
     quite
    2.88
    quite
    2.48
     very
    2.48
     fairly
    2.29
     Quite
    2.27
     pretty
    2.24
    Quite
    2.21
     bastante
    2.06
    very
    1.95
     khá
    1.91
    Act Density 1.273%

    No Known Activations