INDEX
Explanations
dialogue interactions between characters
New Auto-Interp
Negative Logits
icipated
-0.76
inement
-0.73
actionDate
-0.70
result
-0.68
inction
-0.68
rall
-0.65
conservancy
-0.64
İĭ
-0.64
¥ŀ
-0.63
arcity
-0.63
POSITIVE LOGITS
yeah
1.38
yeah
1.32
Yeah
1.24
kidding
1.20
sir
1.17
hhh
1.14
hhhh
1.12
Yeah
1.11
yea
1.08
fuck
1.07
Activations Density 4.530%