INDEX
    Explanations

    phrases describing personal or professional actions taken against individuals

    New Auto-Interp
    Negative Logits
     bordeaux
    -0.97
     napoli
    -0.92
     milano
    -0.90
     lyon
    -0.89
     fuj
    -0.88
     écl
    -0.87
     ibiza
    -0.86
     oreo
    -0.86
     thermomix
    -0.86
     levis
    -0.85
    POSITIVE LOGITS
     been
    0.81
     become
    0.68
    been
    0.68
     had
    0.62
     BEEN
    0.61
     come
    0.61
     gone
    0.61
     reportedly
    0.60
     has
    0.56
     already
    0.56
    Act Density 0.453%

    No Known Activations