INDEX
Explanations
conversational cues and questions about relationships and emotions
Text followed by conjunctions like "but", "if", "so"
the way it is
New Auto-Interp
Negative Logits
&&
-0.72
arvio
-0.62
mihi
-0.60
">—
-0.59
―――――
-0.58
elbise
-0.58
}>
-0.58
[
-0.56
прочем
-0.56
-0.55
POSITIVE LOGITS
Its
1.51
its
1.48
Its
1.44
theres
1.32
thats
1.21
i
1.20
shes
1.14
its
1.11
Thats
1.08
dont
1.05
Activations Density 0.534%