INDEX
Explanations
phrases that indicate a balance between action and restraint
New Auto-Interp
Negative Logits
ifique
-0.15
.preferences
-0.14
dale
-0.14
amik
-0.14
ambio
-0.13
entifier
-0.13
å½»
-0.13
Buna
-0.13
Shortcut
-0.13
ushima
-0.13
POSITIVE LOGITS
reign
0.34
temper
0.34
tone
0.33
Reign
0.33
ton
0.32
dial
0.31
Tone
0.30
tempered
0.27
Ton
0.27
moderation
0.27
Activations Density 0.153%