INDEX
    Explanations

    expressions or mentions of gratitude towards support

    expressions of gratitude and mentions of support

    New Auto-Interp
    Negative Logits
    ãĥ£
    -0.70
    kered
    -0.66
    vern
    -0.63
     pores
    -0.63
    iren
    -0.63
     Hebdo
    -0.63
     sweat
    -0.62
     contrad
    -0.61
     dare
    -0.60
     unbeliev
    -0.60
    POSITIVE LOGITS
    Support
    0.78
     support
    0.77
    hesis
    0.75
    heses
    0.75
    orship
    0.75
    bands
    0.75
     Supports
    0.75
    enza
    0.74
    asio
    0.72
    ament
    0.71
    Act Density 0.051%

    No Known Activations