INDEX
Explanations
phrases related to confronting issues or situations
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1363
+0.11
0.4%
991
+0.10
0.3%
528
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
991
+0.11
0.02
1604
+0.10
0.02
1142
+0.10
0.02
Negative Logits
shenan
-0.90
maneu
-0.77
Incenti
-0.76
resear
-0.73
accla
-0.72
volunte
-0.72
reluct
-0.72
ineffec
-0.69
emphat
-0.69
depic
-0.69
POSITIVE LOGITS
confront
1.11
confronted
1.05
confronting
1.00
confrontation
0.99
confronts
0.85
faced
0.81
face
0.75
Confront
0.74
faced
0.70
facing
0.69
Activations Density 0.103%