INDEX
    Explanations

    references to warfare and expressions of positivity

    New Auto-Interp
    Negative Logits
     noqa
    -0.87
    ERVIS
    -0.65
     viewDidLoad
    -0.59
    TRAILING
    -0.59
    Slf
    -0.58
    deepcopy
    -0.58
    roek
    -0.57
    aktor
    -0.53
    getWriter
    -0.53
    :✨
    -0.53
    POSITIVE LOGITS
     незавершена
    0.78
     cherchés
    0.74
    Geografi
    0.67
    GEBURTSDATUM
    0.66
     opposition
    0.63
    Nice
    0.62
     Decorative
    0.61
    Beautiful
    0.60
    rungsseite
    0.60
     opponents
    0.59
    Act Density 0.192%

    No Known Activations