INDEX
    Explanations

    references to emotional or physical pain

    New Auto-Interp
    Negative Logits
     風
    -0.15
    iffin
    -0.15
     Sink
    -0.15
    iran
    -0.15
    bai
    -0.15
     Coff
    -0.14
    ioc
    -0.14
    i
    -0.14
    íĴĪ
    -0.14
    edList
    -0.14
    POSITIVE LOGITS
    usp
    0.15
    ÑĸÑģÑĤ
    0.15
    lessly
    0.14
    imet
    0.14
    .li
    0.13
     Hilton
    0.13
    mts
    0.13
    nem
    0.13
    Math
    0.13
    ек
    0.13
    Act Density 0.033%

    No Known Activations