INDEX
Explanations
phrases that indicate emerging or developing entities
New Auto-Interp
Negative Logits
ken
-0.17
endif
-0.15
adar
-0.15
нак
-0.15
Whip
-0.15
acher
-0.14
ched
-0.14
upright
-0.14
Compat
-0.14
ugar
-0.14
POSITIVE LOGITS
/down
0.25
coming
0.23
down
0.23
-down
0.21
Coming
0.20
comer
0.19
coming
0.19
running
0.19
oming
0.18
down
0.18
Activations Density 0.009%