INDEX
Explanations
phrases related to providing instructions or steps
commanding phrases suggesting action or engagement
New Auto-Interp
Negative Logits
bara
-0.81
quartered
-0.75
éĹ
-0.70
lied
-0.69
otten
-0.65
ELD
-0.64
KO
-0.63
ago
-0.63
alky
-0.62
pine
-0.60
POSITIVE LOGITS
ourselves
1.07
together
0.71
OUR
0.68
querade
0.66
clarify
0.64
anew
0.64
aside
0.63
collectively
0.63
REAL
0.62
tomorrow
0.62
Activations Density 0.102%