INDEX
    Explanations

    phrases related to usefulness and appreciation of resources or tools

    New Auto-Interp
    Negative Logits
    ontent
    -0.14
    lak
    -0.14
    Fizz
    -0.14
    ì¹Ļ
    -0.13
    Outlined
    -0.13
    à¥Įन
    -0.13
    олж
    -0.13
    .fhir
    -0.13
    èī
    -0.13
    acomment
    -0.13
    POSITIVE LOGITS
     useful
    0.68
     Useful
    0.59
     helpful
    0.58
     handy
    0.53
     usefulness
    0.52
     Helpful
    0.49
     полез
    0.48
     Handy
    0.44
     valuable
    0.44
     hữu
    0.41
    Act Density 0.285%

    No Known Activations