INDEX
Explanations
dialogue interactions between two characters, specifically focusing on the back-and-forth exchanges
New Auto-Interp
Negative Logits
escription
-0.87
ascus
-0.81
eatures
-0.78
inement
-0.72
æ©
-0.71
avorite
-0.71
abad
-0.70
irements
-0.69
lection
-0.67
velop
-0.66
POSITIVE LOGITS
Yeah
1.47
Alright
1.43
Hmm
1.31
Yeah
1.26
Exactly
1.25
Okay
1.23
Oh
1.17
Nope
1.12
Huh
1.12
Yep
1.10
Activations Density 0.100%