INDEX
Explanations
phrases indicating ease and simplicity in processes or actions
New Auto-Interp
Negative Logits
mart
-0.15
æ±
-0.14
dens
-0.14
unes
-0.14
digs
-0.13
compromise
-0.13
somehow
-0.13
æīį
-0.13
wers
-0.13
Strict
-0.13
POSITIVE LOGITS
easy
0.37
simple
0.35
easiest
0.32
easy
0.31
simple
0.31
simplicity
0.30
-simple
0.29
simples
0.29
einfach
0.28
/simple
0.28
Activations Density 0.207%