INDEX
Explanations
phrases related to development and growth
New Auto-Interp
Negative Logits
agar
-0.15
akh
-0.14
askell
-0.14
Hust
-0.14
Hut
-0.14
Duncan
-0.13
ее
-0.13
freshly
-0.13
Gast
-0.13
avis
-0.13
POSITIVE LOGITS
bec
0.31
become
0.30
becomes
0.27
became
0.26
æĪIJäºĨ
0.25
Become
0.24
Become
0.24
Became
0.24
becoming
0.23
æĪIJ为
0.21
Activations Density 0.264%