INDEX
Explanations
words related to external qualities and actions
New Auto-Interp
Negative Logits
лин
-0.17
lez
-0.16
fulness
-0.16
ÑĢиÑĦ
-0.15
aptor
-0.15
Nay
-0.15
Omn
-0.15
ql
-0.15
azzi
-0.15
INAL
-0.14
POSITIVE LOGITS
ensive
0.31
remely
0.30
inction
0.29
/ext
0.26
rem
0.26
ext
0.25
(ext
0.24
ending
0.24
ender
0.23
ention
0.23
Activations Density 0.012%