INDEX
    Explanations

    occurrences of the word "exchange" and its variants in various contexts

    New Auto-Interp
    Negative Logits
    li
    -0.16
    amate
    -0.15
    azzo
    -0.15
    odore
    -0.15
    fil
    -0.15
    led
    -0.14
    anners
    -0.14
    ouden
    -0.14
    ¬
    -0.14
    arga
    -0.14
    POSITIVE LOGITS
    frau
    0.17
    anter
    0.17
    ept
    0.15
    esin
    0.15
    able
    0.15
    istrovstvÃŃ
    0.15
    ois
    0.14
    esa
    0.14
    iros
    0.14
    CompleteListener
    0.14
    Act Density 0.019%

    No Known Activations