INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     deposit
    -0.07
     fuss
    -0.07
     signup
    -0.06
    かれ
    -0.06
     bikes
    -0.06
    想去
    -0.06
    deps
    -0.06
     Buy
    -0.06
     employee
    -0.06
     --------------------------------------------------------------------------------
    -0.06
    POSITIVE LOGITS
    0.07
    0.07
     empres
    0.07
     factura
    0.07
    NDAR
    0.07
     Fib
    0.06
    0.06
    ガイド
    0.06
     COMPUT
    0.06
    مدار
    0.06
    Act Density 0.008%

    No Known Activations