INDEX
    Explanations

    content related to political manipulation and unrest

    New Auto-Interp
    Negative Logits
    Tracce
    -0.58
     ]];
    -0.52
    dataProvider
    -0.52
    freopen
    -0.51
     défaite
    -0.48
     saltar
    -0.46
    ]));
    
    -0.46
    lismo
    -0.46
    setAccessible
    -0.46
    ]),
    
    -0.46
    POSITIVE LOGITS
    CppMethod
    0.75
    rungsseite
    0.68
    0.65
    FormTagHelper
    0.63
    üyada
    0.58
     يتيمه
    0.56
    fog
    0.55
     שוליים
    0.53
    findpost
    0.53
    ukunft
    0.51
    Act Density 0.149%

    No Known Activations