INDEX
    Explanations

    the word "told" and its variations, indicating information-sharing or communication

    New Auto-Interp
    Negative Logits
    eteria
    -0.17
    dic
    -0.17
    ôn
    -0.15
    Ø
    -0.15
     ongoing
    -0.14
    ίγ
    -0.14
    azer
    -0.14
    dere
    -0.14
    762
    -0.14
    etine
    -0.13
    POSITIVE LOGITS
    zim
    0.16
    ĺħ
    0.15
    orian
    0.15
    exion
    0.15
    اÙĪÙĨد
    0.15
     id
    0.14
    AVA
    0.14
    cky
    0.13
    robots
    0.13
    berger
    0.13
    Act Density 0.018%

    No Known Activations