INDEX
    Explanations

    references to popular culture and social commentary

    New Auto-Interp
    Negative Logits
     Whilst
    -1.14
    Whilst
    -1.14
     utilising
    -0.93
     poichè
    -0.91
     whilst
    -0.88
     utilise
    -0.83
    favourable
    -0.80
     endeavour
    -0.79
     endeavours
    -0.79
    hésite
    -0.78
    POSITIVE LOGITS
     pols
    0.77
     ain
    0.73
     bof
    0.67
     herewith
    0.66
     lousy
    0.66
     darn
    0.66
     …)
    0.66
     gee
    0.65
     ...)
    0.64
     sez
    0.63
    Act Density 0.901%

    No Known Activations