INDEX
    Explanations

    first-person pronouns and expressions of personal experience or emotion

    New Auto-Interp
    Negative Logits
    μÎŃ
    -0.15
    ìĦ
    -0.13
    OrNull
    -0.13
    earch
    -0.13
    gger
    -0.13
    .documentation
    -0.13
    Ø·Ùģ
    -0.13
    /release
    -0.13
     TIMEOUT
    -0.12
    à¸ģลาà¸ĩ
    -0.12
    POSITIVE LOGITS
     too
    0.23
     second
    0.23
     agree
    0.23
     Agree
    0.23
     Cong
    0.20
    agree
    0.20
     echo
    0.19
     agre
    0.19
    Cong
    0.19
     glad
    0.19
    Act Density 0.141%

    No Known Activations