INDEX
Explanations
concepts related to survival and essential needs
New Auto-Interp
Negative Logits
ÑĤим
-0.07
失
-0.07
Å¥
-0.06
ngle
-0.06
çĥĪ
-0.06
alles
-0.06
dÃŃ
-0.06
á»ĵng
-0.06
Tat
-0.06
seau
-0.06
POSITIVE LOGITS
functioning
0.10
survival
0.08
life
0.08
lives
0.08
survive
0.08
çĶŁæ´»
0.08
function
0.07
leben
0.07
function
0.07
жизни
0.07
Activations Density 0.025%