INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Invoice
    -0.08
     acha
    -0.08
     افت
    -0.07
    .invoice
    -0.07
    (ignore
    -0.07
    invoice
    -0.07
    -0.07
     citizenship
    -0.07
     Question
    -0.07
    Receipt
    -0.07
    POSITIVE LOGITS
    বিশ
    0.10
    WER
    0.08
     fireworks
    0.08
     waterfalls
    0.08
     Canyon
    0.08
     Cuba
    0.08
     waves
    0.08
    ĵoj
    0.08
    ėjo
    0.08
    メリ
    0.08
    Act Density 0.002%

    No Known Activations