INDEX
Explanations
relevant information about legal, court, and public policy matters
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.24
1.3%
172
+0.15
0.8%
1708
+0.12
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
172
+0.24
0.12
1708
+0.15
0.10
645
+0.12
0.10
Negative Logits
<bos>
-3.06
/***
-0.87
///**
-0.66
Související
-0.64
/***
-0.60
/*@
-0.59
/**
-0.58
Vegeu
-0.55
mobilize
-0.55
declare
-0.55
POSITIVE LOGITS
milf
1.19
maneu
1.17
pixar
1.14
perfet
1.14
greate
1.13
fortn
1.11
chrysler
1.11
affor
1.10
madonna
1.08
swarovski
1.07
Activations Density 0.345%