INDEX
    Explanations

    specific nouns and terms related to activities and interactions

    New Auto-Interp
    Negative Logits
     kasarigan
    -0.64
    Geplaatst
    -0.58
    󠁴
    -0.53
     للاسماء
    -0.48
     surla
    -0.47
     Савезне
    -0.45
    ThemeOverlay
    -0.43
     الرياضيه
    -0.43
     *);
    -0.43
    wapV
    -0.42
    POSITIVE LOGITS
    1.19
    1.16
    sthe
    1.00
    们的
    0.88
    ‌ها
    0.86
    es
    0.83
    ss
    0.83
    들은
    0.81
    ́s
    0.81
    ssss
    0.77
    Act Density 3.097%

    No Known Activations