INDEX
    Explanations

    personal pronouns and relational phrases indicating involvement or connection

    expressions of personal feelings and opinions

    New Auto-Interp
    Negative Logits
     Bridgewater
    -0.69
     Amp
    -0.65
     Toast
    -0.64
     Verge
    -0.64
     Redmond
    -0.63
     Chao
    -0.63
     Davies
    -0.63
     Robinson
    -0.62
     Eliot
    -0.62
     Wellington
    -0.61
    POSITIVE LOGITS
    âĢ
    1.71
    ï¸ı
    1.18
    Ò
    1.13
    Â
    1.05
    âĪ
    1.05
    â
    1.00
    âĶ
    0.99
    ðŁĺ
    0.97
    ñ
    0.97
    »
    0.96
    Act Density 0.283%

    No Known Activations