INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ябре
    -0.49
    rinhos
    -0.47
     Dan
    -0.47
     Lande
    -0.47
     sánh
    -0.46
    ibel
    -0.46
     CreateTagHelper
    -0.45
    user
    -0.44
     देखें
    -0.43
     Lan
    -0.43
    POSITIVE LOGITS
    ütün
    0.68
     continúas
    0.62
     صوتيه
    0.60
     Projektu
    0.59
     propOrder
    0.58
    INSEE
    0.57
     PopupWindow
    0.56
     بيها
    0.54
     flesta
    0.54
     gesteld
    0.52
    Act Density 0.005%

    No Known Activations