INDEX
Explanations
The neuron fires on phrases about engaging or communicating with alien life—especially verbs like “interact,” “communicate,” and related terms around ethical, respectful first‐contact guidelines.
New Auto-Interp
Negative Logits
τή
-0.07
im
-0.06
頻
-0.06
adopts
-0.06
عم
-0.06
ولوژی
-0.06
итив
-0.06
//!
-0.06
@class
-0.06
آخرین
-0.06
POSITIVE LOGITS
redistrib
0.06
UICollectionViewCell
0.06
жизнь
0.06
emoc
0.06
vene
0.06
MESSAGE
0.06
Nich
0.06
Tracy
0.06
disruption
0.06
asyonu
0.06
Activations Density 0.041%