INDEX
    Explanations

    phrases related to conflicts, confrontations, and political opinions

    New Auto-Interp
    Negative Logits
    oltán
    -0.50
    lustre
    -0.47
    unehmen
    -0.46
    genicity
    -0.45
    علق
    -0.45
    liness
    -0.44
     IndexError
    -0.43
     IOError
    -0.43
     carelessly
    -0.43
    tragung
    -0.42
    POSITIVE LOGITS
     inder
    0.94
     thermomix
    0.94
     dovr
    0.92
     sappi
    0.91
     sopr
    0.90
     migli
    0.88
     solidar
    0.86
     dichi
    0.83
     dises
    0.83
     erec
    0.82
    Act Density 0.120%

    No Known Activations