INDEX
Explanations
phrases related to social support for families and causes
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.08
0.3%
122
+0.07
0.3%
888
+0.05
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1425
+0.08
0.04
1598
+0.07
0.04
727
+0.05
0.03
Negative Logits
<bos>
-1.77
ⓧ
-1.05
-0.82
/**
-0.80
<?
-0.77
/*
-0.69
<?
-0.65
Enllaços
-0.62
/***
-0.60
#
-0.60
POSITIVE LOGITS
families
1.90
Families
1.84
Families
1.68
families
1.68
jaya
1.54
bandung
1.50
wien
1.49
haup
1.45
Juf
1.42
pican
1.41
Activations Density 0.157%