INDEX
    Explanations

    sentences involving accusations and denials related to wrongdoing

    New Auto-Interp
    Negative Logits
     top
    -0.44
     non
    -0.43
     гр
    -0.42
    SuppressMessage
    -0.41
     ofrec
    -0.40
     zd
    -0.40
    tende
    -0.40
     pa
    -0.40
    Pa
    -0.40
    стей
    -0.39
    POSITIVE LOGITS
     EconPapers
    0.93
    LookAnd
    0.87
    GEBURTSDATUM
    0.85
    RenderAtEndOf
    0.83
     فريبيس
    0.80
     Majefty
    0.75
    MLLoader
    0.75
    RectangleBorder
    0.75
     Dicapai
    0.72
     ſtate
    0.72
    Act Density 0.811%

    No Known Activations