INDEX
    Explanations

    commands or instructions being given or received

    words and phrases indicating communication or information exchange.

    New Auto-Interp
    Negative Logits
    cv
    -0.76
    uded
    -0.72
    alach
    -0.70
    wx
    -0.69
    til
    -0.67
    agos
    -0.66
    malink
    -0.66
    usat
    -0.66
    LGBT
    -0.64
    respective
    -0.64
    POSITIVE LOGITS
     him
    0.81
     me
    0.75
     politely
    0.70
     paramedics
    0.68
     kindly
    0.66
     us
    0.65
     them
    0.65
     Mahjong
    0.64
     angrily
    0.64
     '[
    0.64
    Act Density 0.304%

    No Known Activations