INDEX
    Explanations

    following "in" or "including"

    New Auto-Interp
    Negative Logits
    AboutDlg
    0.39
    átky
    0.38
    而已
    0.36
    قتصاد
    0.34
    0.34
    creas
    0.34
    تمع
    0.33
     girdi
    0.33
    รษฐกิจ
    0.33
     públicas
    0.33
    POSITIVE LOGITS
    0.40
     בו
    0.37
    boo
    0.36
    Stern
    0.36
    గ్ర
    0.36
     ವಿಧಾನಸ
    0.35
    DING
    0.35
    0.35
     अधिकार
    0.35
    Nord
    0.34
    Act Density 0.006%

    No Known Activations