INDEX
    Explanations

    occurrences of the word "put" and its variations

    New Auto-Interp
    Negative Logits
    ewire
    -0.17
    itez
    -0.16
    haust
    -0.16
    eden
    -0.15
    inventory
    -0.15
    stras
    -0.15
    ока
    -0.15
     Luft
    -0.14
    ening
    -0.14
    509
    -0.14
    POSITIVE LOGITS
    nam
    0.21
    atively
    0.19
    aker
    0.18
     puts
    0.18
    ative
    0.17
    AKER
    0.17
     put
    0.17
     emphasis
    0.17
    tin
    0.16
    put
    0.16
    Act Density 0.064%

    No Known Activations