INDEX
    Explanations

    negative or problematic phrases related to health conditions

    New Auto-Interp
    Negative Logits
    tal
    -0.52
    '
    -0.49
     الاتحاد
    -0.48
    es
    -0.48
    cul
    -0.48
    -0.46
    end
    -0.46
    -0.45
    kal
    -0.45
    .
    -0.45
    POSITIVE LOGITS
    sizeCache
    1.12
    ]")]
    1.04
    ")));
    
    1.02
     referenties
    1.01
     الرياضيه
    1.01
    __":
    
    1.00
    complexContent
    0.98
    __":
    0.98
    脚注の使い方
    0.98
    انجليز
    0.97
    Act Density 0.177%

    No Known Activations