INDEX
Explanations
words related to the internet and technology
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
553
+0.13
0.5%
1677
+0.13
0.5%
1306
+0.11
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
553
+0.13
0.04
1677
+0.13
0.03
196
+0.11
0.03
Negative Logits
oretical
-0.48
zdrow
-0.47
gainera
-0.47
actylus
-0.46
agonal
-0.45
Referential
-0.45
GARET
-0.43
edback
-0.43
izvē
-0.41
눠
-0.41
POSITIVE LOGITS
Internet
1.24
internet
1.23
Internet
1.21
internet
1.19
INTERNET
1.05
INTERNET
0.88
morire
0.78
Interne
0.75
credere
0.71
compréhen
0.69
Activations Density 0.063%