INDEX
    Explanations

    terms related to toxicity in a scientific context

    New Auto-Interp
    Negative Logits
    -0.58
    AGR
    -0.52
    hado
    -0.51
    Ucraina
    -0.51
     جغرافيا
    -0.51
    bleshooting
    -0.50
    AddTagHelper
    -0.50
    cri
    -0.49
    {}{}
    -0.49
    ydd
    -0.48
    POSITIVE LOGITS
    [toxicity=0]
    1.21
    Personendaten
    0.70
    ScopeManager
    0.69
     HasFactory
    0.69
     IndexPath
    0.68
    toxicity
    0.66
    }}/>
    0.65
    HomeAsUpEnabled
    0.65
    存于互联网档案馆
    0.62
    TargetException
    0.61
    Act Density 0.060%

    No Known Activations