INDEX
Explanations
words related to computer commands and online communication
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.10
0.5%
204
+0.06
0.3%
1618
+0.06
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1624
+0.10
0.05
8
+0.06
0.05
1310
+0.06
0.04
Negative Logits
<bos>
-1.43
<?
-1.07
-1.05
/**
-0.92
ⓧ
-0.87
/***
-0.84
<?
-0.83
///**
-0.78
/*!
-0.77
//{
-0.72
POSITIVE LOGITS
stockholm
1.58
wien
1.53
lidl
1.46
affor
1.42
lamborghini
1.42
ibiza
1.41
mef
1.40
squa
1.38
maneu
1.38
dises
1.38
Activations Density 0.345%