INDEX
Explanations
phrases indicating agreement or disagreement in conversation
New Auto-Interp
Negative Logits
евиÑĩ
-0.18
alian
-0.15
avy
-0.15
ายà¸Ļ
-0.15
ancode
-0.14
roz
-0.14
nemonic
-0.14
instein
-0.14
emark
-0.14
.called
-0.14
POSITIVE LOGITS
yes
0.41
YES
0.41
Yes
0.40
yes
0.37
YES
0.36
Yes
0.34
NO
0.32
_yes
0.31
"Yes
0.30
“Yes
0.29
Activations Density 0.073%