INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     rozwiąz
    -0.08
     visas
    -0.08
     nier
    -0.07
     yapıl
    -0.07
    ệc
    -0.07
     filmer
    -0.07
     ate
    -0.07
     Procedure
    -0.07
     Cost
    -0.07
     Routine
    -0.07
    POSITIVE LOGITS
    力量
    0.12
     forças
    0.11
     forces
    0.11
     fuerzas
    0.10
     Forces
    0.10
     силы
    0.10
    0.08
     efforts
    0.08
     Kräfte
    0.08
     محتر
    0.08
    Act Density 0.012%

    No Known Activations