INDEX
    Explanations

    phrases indicating cause and effect relationships

    New Auto-Interp
    Negative Logits
     rangs
    -0.42
    ミング
    -0.42
    AxisAlignment
    -0.40
    ULAR
    -0.38
    coroutines
    -0.37
     worthwhile
    -0.37
    ďte
    -0.36
     Tagged
    -0.36
     Chriftian
    -0.35
     ozna
    -0.35
    POSITIVE LOGITS
    GEBURTSDATUM
    0.87
    ">//
    0.84
    homonymie
    0.82
     Мексичка
    0.82
     ويكيميديا
    0.79
    oredCriteria
    0.78
    RegistryLite
    0.77
    ArrowToggle
    0.77
     تضيفلها
    0.75
    wußt
    0.75
    Act Density 1.158%

    No Known Activations