INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Administrativna
    -0.59
    脚注の使い方
    -0.49
     propOrder
    -0.47
     дописавши
    -0.46
     ویکی‌پدی
    -0.46
    MockBean
    -0.46
     defStyleAttr
    -0.45
    awtextra
    -0.45
     beginnetje
    -0.44
    expandindo
    -0.44
    POSITIVE LOGITS
    före
    0.42
    futbolista
    0.41
    Spenden
    0.38
    /**
    0.37
     للاسماء
    0.36
    جغرافيا
    0.36
     boycot
    0.35
     depic
    0.35
     BorderSide
    0.34
     solidar
    0.34
    Act Density 0.077%

    No Known Activations