INDEX
Explanations
dialogue markers or quoted speech
quoted speech or dialogue
New Auto-Interp
Negative Logits
upstream
-0.87
pole
-0.78
downstream
-0.77
outsourcing
-0.77
frontline
-0.76
characterized
-0.71
isolated
-0.70
valued
-0.69
headquartered
-0.69
targeted
-0.69
POSITIVE LOGITS
Oh
1.26
Yeah
1.16
Hey
1.16
Fuck
1.14
Hmm
1.12
Yes
1.10
Sorry
1.10
Damn
1.07
Well
1.07
Alright
1.06
Activations Density 0.109%