INDEX
Explanations
phrases related to progressive and innovative thinking
New Auto-Interp
Negative Logits
kö
-0.18
pector
-0.17
olie
-0.16
Ìģc
-0.15
askell
-0.15
êu
-0.15
iber
-0.14
pecially
-0.14
MUX
-0.14
riad
-0.14
POSITIVE LOGITS
/back
0.35
-thinking
0.31
-facing
0.29
ward
0.29
ly
0.28
-looking
0.28
wards
0.28
/down
0.28
-leaning
0.24
most
0.24
Activations Density 0.049%