INDEX
Explanations
phrases that emphasize the importance of needs, obligations, and precautions in various contexts
New Auto-Interp
Negative Logits
themselves
-0.23
itself
-0.18
iedo
-0.15
undry
-0.15
createClass
-0.15
acro
-0.14
Ø®ÙĪØ¯Ø´
-0.14
antee
-0.13
ene
-0.13
yn
-0.13
POSITIVE LOGITS
yourself
0.30
yourselves
0.23
Yourself
0.20
можеÑĤе
0.18
guys
0.17
ırak
0.16
rott
0.15
strup
0.14
mue
0.14
nger
0.14
Activations Density 1.294%