INDEX
Explanations
phrases related to effort or initiatives towards a goal
New Auto-Interp
Negative Logits
erland
-0.16
vala
-0.16
quito
-0.16
cznie
-0.15
alis
-0.15
ocrates
-0.15
ams
-0.15
yna
-0.15
ILLA
-0.15
erken
-0.14
POSITIVE LOGITS
erves
0.32
orts
0.31
iciency
0.31
luent
0.30
icient
0.29
fect
0.28
ort
0.27
ingham
0.26
iciencies
0.26
ector
0.26
Activations Density 0.006%