INDEX
Explanations
the word "detection" or "detected" in various contexts including in academic papers, disease detection, and quarantine measures
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.08
0.3%
1407
+0.08
0.3%
313
+0.07
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1448
+0.08
0.02
1250
+0.08
0.02
1041
+0.07
0.02
Negative Logits
<bos>
-1.19
public
-0.75
//
-0.72
.
-0.71
,
-0.70
/*
-0.69
/**
-0.69
↵↵
-0.66
add
-0.65
-0.65
POSITIVE LOGITS
affor
2.07
increa
2.01
maneu
1.96
fta
1.94
détect
1.92
detection
1.90
Detection
1.90
Juf
1.90
ftu
1.89
thut
1.89
Activations Density 0.097%