INDEX
    Explanations

    instances of criticism or condemnation

    New Auto-Interp
    Negative Logits
    ErrorException
    -0.15
    ptal
    -0.14
    WithData
    -0.14
     tavs
    -0.14
    виÑħ
    -0.13
    morgan
    -0.13
    OutOfBoundsException
    -0.13
    ά
    -0.13
     Ø¥ÙĦÙĬÙĩ
    -0.13
     erotik
    -0.13
    POSITIVE LOGITS
     for
    0.72
    for
    0.55
    	for
    0.47
    สำหร
    0.40
     voor
    0.39
     untuk
    0.39
     für
    0.38
     за
    0.38
     för
    0.37
     για
    0.35
    Act Density 0.108%

    No Known Activations