INDEX
    Explanations

    phrases related to communication and interaction, likely with emotional undertones

    New Auto-Interp
    Negative Logits
    ãĥ¼ãĥĨ
    -0.71
     Downs
    -0.69
     recogn
    -0.68
     Borough
    -0.66
     interf
    -0.65
    uckland
    -0.62
     Thomson
    -0.62
     WhatsApp
    -0.61
     dispers
    -0.61
    hots
    -0.60
    POSITIVE LOGITS
    Ļ
    1.44
    ¡
    1.18
    Ķ
    1.16
    ĺ
    1.12
    ł
    1.11
    «
    1.09
    ĸ
    1.08
    ĵ
    1.08
    Ń
    1.08
    ľ
    1.07
    Act Density 0.254%

    No Known Activations