INDEX
Explanations
words related to the concept of attracting or being attracted to something
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.17
0.9%
25
+0.12
0.6%
1482
+0.12
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1038
+0.17
0.03
25
+0.12
0.03
1482
+0.12
0.03
Negative Logits
<bos>
-3.15
Vegeu
-0.86
ⓧ
-0.86
Referencer
-0.68
PutMapping
-0.68
/***
-0.67
SizeMode
-0.63
glMatrixMode
-0.62
Fordítás
-0.62
estacks
-0.62
POSITIVE LOGITS
lele
1.51
stockholm
1.51
wien
1.49
meis
1.46
aen
1.42
thut
1.39
mef
1.37
riva
1.36
uniqu
1.35
vry
1.34
Activations Density 0.120%