INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     owns
    -0.08
    `;↵↵
    -0.08
     rumor
    -0.08
    )-(
    -0.07
    -0.07
     lập
    -0.07
    `;
    -0.07
    `.↵↵
    -0.07
     $,
    -0.07
    بلغ
    -0.07
    POSITIVE LOGITS
     flowers
    0.09
     Extraction
    0.09
     groenten
    0.09
     extraction
    0.09
    truncate
    0.08
    /live
    0.08
     légumes
    0.08
    .extract
    0.08
     extracts
    0.08
     trunc
    0.08
    Act Density 0.003%

    No Known Activations