INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     currently
    -0.08
     Currently
    -0.08
     attacked
    -0.07
     atualmente
    -0.07
    Қ
    -0.07
    قى
    -0.07
     derzeit
    -0.07
     smarter
    -0.07
     actuellement
    -0.07
    оват
    -0.07
    POSITIVE LOGITS
     celu
    0.08
    äfte
    0.08
    -purpose
    0.08
    battery
    0.08
     amerik
    0.08
     http
    0.08
     वाली
    0.08
     रूपमा
    0.08
    OM
    0.07
    ahinta
    0.07
    Act Density 0.004%

    No Known Activations