INDEX
Explanations
phrases related to progress and forward movement
New Auto-Interp
Negative Logits
kö
-0.16
Ìģc
-0.15
olie
-0.15
à¥ľ
-0.14
kit
-0.14
elder
-0.14
vrai
-0.14
kowski
-0.14
kat
-0.14
vero
-0.14
POSITIVE LOGITS
ward
0.20
/back
0.20
wards
0.20
-thinking
0.19
/down
0.17
/up
0.16
forward
0.14
edList
0.14
ilenames
0.14
SSIP
0.14
Activations Density 0.042%