INDEX
Explanations
references to air-related terms or concepts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1296
+0.21
0.8%
1379
+0.12
0.5%
892
+0.12
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1296
+0.21
0.06
892
+0.12
0.04
577
+0.12
0.03
Negative Logits
CascadeType
-0.59
numberWith
-0.47
detal
-0.44
Pot
-0.43
Tito
-0.41
oten
-0.40
مشين
-0.40
FetchType
-0.40
Monument
-0.40
Provid
-0.39
POSITIVE LOGITS
air
1.15
Air
1.08
Air
1.06
air
1.01
AIR
0.97
airs
0.84
AIR
0.81
airpods
0.77
idać
0.75
ypeł
0.74
Activations Density 0.069%