INDEX
    Explanations

    monetary values and percentages

    New Auto-Interp
    Negative Logits
    Ìģ
    -0.15
    ifa
    -0.15
    aje
    -0.14
    uary
    -0.14
    unda
    -0.14
    elib
    -0.14
    theses
    -0.14
    idine
    -0.14
    pyx
    -0.14
    ula
    -0.14
    POSITIVE LOGITS
    000
    0.24
    Ù¬
    0.15
    oom
    0.15
    002
    0.15
    lisi
    0.15
    ä¸ĩåĨĨ
    0.14
    Æł
    0.14
    heimer
    0.14
    006
    0.14
     dul
    0.14
    Act Density 0.012%

    No Known Activations