INDEX
    Explanations

    instances of the word "eliminate" and its variants, indicating a focus on reducing or removing obstacles or risks

    New Auto-Interp
    Negative Logits
    olt
    -0.16
    embros
    -0.15
     gre
    -0.14
    eil
    -0.14
    综åIJĪ
    -0.14
     průbÄĽhu
    -0.14
    yonel
    -0.14
     vers
    -0.14
    uty
    -0.13
    ä¸įåΰ
    -0.13
    POSITIVE LOGITS
    æİī
    0.20
    /mit
    0.18
    aket
    0.17
    786
    0.16
    /min
    0.16
     entirely
    0.16
    /repos
    0.15
    کس
    0.15
    enders
    0.14
    /disable
    0.14
    Act Density 0.092%

    No Known Activations