INDEX
Explanations
words related to processes of democratization and visibility in various contexts
New Auto-Interp
Negative Logits
urf
-0.16
ness
-0.15
ach
-0.15
NESS
-0.15
anye
-0.15
alc
-0.14
cot
-0.14
lessly
-0.14
umuz
-0.14
iver
-0.14
POSITIVE LOGITS
fend
0.16
.scalablytyped
0.16
boz
0.15
azor
0.15
áce
0.14
cce
0.14
ceae
0.14
/stretch
0.14
odega
0.14
ounters
0.14
Activations Density 0.251%