INDEX
    Explanations

    phrases expressing strong opinions or stances

    New Auto-Interp
    Negative Logits
     hcm
    -0.99
     palab
    -0.98
     siena
    -0.94
     thut
    -0.93
     vne
    -0.93
     santiago
    -0.93
     fatis
    -0.92
     nomine
    -0.92
     parati
    -0.91
     milano
    -0.90
    POSITIVE LOGITS
     ostavi
    0.56
     trust
    0.53
    viewing
    0.51
     watching
    0.50
     understand
    0.48
     expect
    0.48
     internetowa
    0.47
    ometrial
    0.47
     demand
    0.46
     understanding
    0.46
    Act Density 0.402%

    No Known Activations