INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    foreign
    -0.07
     місце
    -0.07
    цент
    -0.07
    dataset
    -0.07
    queues
    -0.07
     donations
    -0.06
    ugu
    -0.06
     runs
    -0.06
     running
    -0.06
     قلب
    -0.06
    POSITIVE LOGITS
    -components
    0.06
    \HttpFoundation
    0.06
     Communications
    0.06
     Gio
    0.06
     Prosecutor
    0.06
     embarrassment
    0.05
    CCCCCC
    0.05
    0.05
     viele
    0.05
     inflater
    0.05
    Act Density 0.021%

    No Known Activations