INDEX
Explanations
Positive adjectives
The neuron fires on subjective evaluative adjectives expressing positive judgment or praise (e.g., “cute,” “beautiful,” “adorable”).
New Auto-Interp
Negative Logits
记录
-0.06
.setFont
-0.06
shredded
-0.06
_THEME
-0.06
необхід
-0.06
десят
-0.06
redirectTo
-0.06
Lilly
-0.06
الانت
-0.05
레
-0.05
POSITIVE LOGITS
cà
0.08
deser
0.07
��
0.07
earm
0.07
Pek
0.06
017
0.06
Kah
0.06
había
0.06
aún
0.06
ninety
0.06
Activations Density 0.038%