INDEX
Explanations
instances of the word "company" and related terms
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1485
+0.10
0.3%
1978
+0.10
0.3%
270
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
270
+0.10
0.06
1735
+0.10
0.05
792
+0.09
0.04
Negative Logits
adal
-0.95
lele
-0.94
Sén
-0.89
kac
-0.86
NKC
-0.83
hcm
-0.81
saha
-0.80
bera
-0.79
istan
-0.79
ché
-0.78
POSITIVE LOGITS
Denote
0.63
Qualquer
0.63
😌
0.60
zonder
0.60
Einfach
0.57
withal
0.57
dintr
0.57
🙃
0.56
lmfao
0.56
geforce
0.56
Activations Density 0.419%