INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     swept
    -0.09
    омина
    -0.08
     molti
    -0.08
    σύ
    -0.08
     autop
    -0.08
    .Dependency
    -0.07
    -0.07
     highways
    -0.07
     winding
    -0.07
     riesgos
    -0.07
    POSITIVE LOGITS
     Illegal
    0.09
    ീത
    0.08
     legal
    0.07
     legality
    0.07
    ായ
    0.07
     Environmental
    0.07
    Illegal
    0.07
    ೀತ
    0.07
     geraten
    0.07
     gehört
    0.07
    Act Density 0.003%

    No Known Activations