INDEX
Explanations
actions or intentions related to pursuing goals or interests
New Auto-Interp
Negative Logits
eba
-0.16
ÌĤ
-0.15
kola
-0.15
olina
-0.14
-gnu
-0.14
irie
-0.14
-0.14
agna
-0.14
ris
-0.14
oli
-0.14
POSITIVE LOGITS
pursue
0.19
pursued
0.17
ä¸ĭåİ»
0.17
ç²¾åĵģ
0.16
pursuit
0.15
purs
0.15
pursuing
0.15
acro
0.15
otos
0.15
Purs
0.15
Activations Density 0.013%