INDEX
Explanations
personality descriptions
This neuron responds to descriptive adjectives and phrases that signal confidence, assertiveness, and leadership qualities.
New Auto-Interp
Negative Logits
си
-0.07
Fram
-0.06
近
-0.06
Как
-0.06
перв
-0.06
Ngoài
-0.06
_MAP
-0.06
(extra
-0.06
_fg
-0.06
ア
-0.06
POSITIVE LOGITS
compiled
0.06
ruh
0.06
aravel
0.06
zdrav
0.06
ослав
0.06
бюджет
0.06
gladly
0.06
INS
0.06
金额
0.06
маш
0.06
Activations Density 0.091%