INDEX
Explanations
references to progress and advancements in various contexts
New Auto-Interp
Negative Logits
facts
-0.16
sonian
-0.15
cio
-0.15
tom
-0.15
icina
-0.15
ptic
-0.14
thing
-0.14
imits
-0.14
lesh
-0.14
ikel
-0.14
POSITIVE LOGITS
ion
0.23
ional
0.20
ivism
0.19
sing
0.19
ions
0.19
ses
0.19
elli
0.18
anches
0.17
ãĥ³ãĥĩ
0.16
-thinking
0.15
Activations Density 0.041%