INDEX
    Explanations

    conversational cues and response patterns in dialogue

    New Auto-Interp
    Negative Logits
     kasarigan
    -0.70
    findpost
    -0.63
     فريبيس
    -0.62
    /*---
    -0.57
    gameserver
    -0.57
    }*/
    
    -0.56
    ("-");
    -0.56
     EconPapers
    -0.56
    twimg
    -0.55
     الرياضيه
    -0.55
    POSITIVE LOGITS
    Yeah
    0.88
     Yeah
    0.85
     yeah
    0.81
     exactly
    0.75
    Exactly
    0.74
     Exactly
    0.74
    Absolutely
    0.74
     absolutely
    0.71
    Totally
    0.68
    Oh
    0.68
    Act Density 0.049%

    No Known Activations