INDEX
    Explanations

    instances of gratitude or expressions of thanks

    New Auto-Interp
    Negative Logits
    ynn
    -0.18
     Jad
    -0.17
    елик
    -0.16
     ded
    -0.16
    addock
    -0.15
     coat
    -0.15
     Ded
    -0.15
    äre
    -0.15
    _factory
    -0.15
     Dün
    -0.14
    POSITIVE LOGITS
    omik
    0.15
    нÑĥл
    0.15
    ono
    0.14
    HEME
    0.14
    enha
    0.14
    iyim
    0.14
    رÙģØª
    0.14
     (;;
    0.14
    amo
    0.14
    ãĤ¹ãĤ¿ãĥ¼
    0.13
    Act Density 0.037%

    No Known Activations