INDEX
    Explanations

    phrases related to information sharing and transfer

    New Auto-Interp
    Negative Logits
    xdb
    -0.17
    inherits
    -0.17
    ÑĤин
    -0.16
    gom
    -0.15
    uç
    -0.15
    -END
    -0.15
    ahren
    -0.15
    ãģıãĤĭ
    -0.14
    ãģ«ãģªãĤĭ
    -0.14
     tradi
    -0.13
    POSITIVE LOGITS
     already
    0.85
    already
    0.74
     Already
    0.72
    Already
    0.68
    å·²ç»ı
    0.63
    å·²
    0.59
     å·²
    0.53
    _already
    0.52
     Ñĥже
    0.52
     sudah
    0.51
    Act Density 0.590%

    No Known Activations