INDEX
    Explanations

    references to "gut" and its variations, indicating focus on instinctive or physical reactions and their relevance in various contexts

    New Auto-Interp
    Negative Logits
    awl
    -0.15
    istrovstvÃŃ
    -0.15
    stry
    -0.15
    lite
    -0.14
    екаÑĢ
    -0.14
    forcer
    -0.14
    qrt
    -0.14
    zell
    -0.13
    ibble
    -0.13
    รว
    -0.13
    POSITIVE LOGITS
    ting
    0.18
    achen
    0.18
    ted
    0.15
     Plug
    0.14
    icol
    0.14
     TextAlign
    0.14
    usan
    0.14
    å·¦åı³
    0.14
    ler
    0.14
    chan
    0.14
    Act Density 0.005%

    No Known Activations