INDEX
Explanations
conversational text speaking about certainty, planning, or intent.
Two-character strings
New Auto-Interp
Negative Logits
-0.57
démocr
-0.57
courants
-0.56
fermés
-0.56
↵↵
-0.56
détru
-0.55
..."
-0.55
cérami
-0.54
)$\\
-0.54
écou
-0.54
POSITIVE LOGITS
ISupport
0.74
+
0.64
dm
0.63
hq
0.62
hp
0.61
bg
0.60
bs
0.59
Fx
0.59
FF
0.58
hb
0.58
Activations Density 52.238%