INDEX
    Explanations

    expressions of recommendation and interest towards engaging with content

    "Interest," "like," and positive sentiment

    interest, enjoyment, or usefulness

    New Auto-Interp
    Negative Logits
    thâu
    -0.54
    Хьажоргаш
    -0.49
    värr
    -0.49
     Example
    -0.48
     StatefulWidget
    -0.47
    RenderAtEndOf
    -0.47
     example
    -0.47
    addCriterion
    -0.46
    NameValuePair
    -0.45
     exemplo
    -0.43
    POSITIVE LOGITS
     interest
    0.93
    interest
    0.90
    Interest
    0.84
     Interest
    0.82
    interested
    0.79
     interested
    0.78
    INTEREST
    0.78
     INTEREST
    0.74
    useful
    0.72
    Worth
    0.70
    Act Density 0.186%

    No Known Activations