INDEX
Explanations
phrases indicating a desire for knowledge or seeking information
New Auto-Interp
Negative Logits
озв
-0.16
æ¨
-0.15
avec
-0.15
strand
-0.14
(æĹ¥
-0.14
upert
-0.14
adar
-0.14
onde
-0.14
ieg
-0.14
intl
-0.14
POSITIVE LOGITS
GetType
0.15
ellig
0.15
564
0.14
DIRECT
0.14
Bid
0.14
çļĦè¯Ŀ
0.14
learn
0.14
752
0.13
omba
0.13
ÅĻÃŃj
0.13
Activations Density 0.052%