INDEX
Explanations
phrases related to communication and working through issues
conversational cues indicating necessity or planning for future discussions
New Auto-Interp
Negative Logits
"#
-0.84
xtap
-0.74
WATCHED
-0.71
"@
-0.67
CONCLUS
-0.65
ĺħ
-0.65
"{-0.62
endi
-0.61
NFL
-0.61
ILCS
-0.61
POSITIVE LOGITS
-"
1.88
..."
1.71
â̦"
1.66
—"
1.53
â̦"
1.34
!?"
1.32
?"
1.30
â̦."
1.27
?!"
1.25
.ãĢį
1.18
Activations Density 0.388%