INDEX
Explanations
performance
The neuron detects words referring to a product’s effectiveness (e.g. “effective,” “effectiveness”) and related quality descriptors.
New Auto-Interp
Negative Logits
grill
-0.07
ra
-0.07
perience
-0.06
enschaft
-0.06
buf
-0.06
apoptosis
-0.06
URLException
-0.06
nnen
-0.06
不足
-0.06
elapsed
-0.06
POSITIVE LOGITS
روسی
0.07
mastur
0.07
(D
0.07
seiz
0.07
takeover
0.06
oud
0.06
intense
0.06
digital
0.06
transparent
0.06
حت
0.06
Activations Density 0.026%