INDEX
Explanations
phrases related to resilience and determination
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
478
+0.10
0.3%
1533
+0.09
0.3%
690
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1672
+0.10
0.03
1531
+0.09
0.04
1543
+0.08
0.04
Negative Logits
<bos>
-0.72
Juf
-0.71
convenable
-0.69
Intere
-0.65
quelquefois
-0.63
Expt
-0.62
habituellement
-0.61
pixabay
-0.59
hacia
-0.58
después
-0.57
POSITIVE LOGITS
tremendously
0.75
incredibly
0.75
absolutely
0.74
incess
0.73
extremely
0.70
immensely
0.67
tremendous
0.65
notori
0.65
everywhere
0.65
very
0.64
Activations Density 0.538%