INDEX
Explanations
the word "just" occurring in various contexts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1177
+0.08
0.2%
78
+0.08
0.2%
1334
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
78
+0.08
0.03
1133
+0.08
0.03
1334
+0.08
0.03
Negative Logits
pageNo
-0.75
itemList
-0.66
courseId
-0.65
requestId
-0.64
FFFF
-0.63
createDate
-0.63
michelin
-0.62
oakley
-0.61
vhs
-0.61
scrat
-0.60
POSITIVE LOGITS
<bos>
0.61
Савезне
0.57
Conclusão
0.56
teater
0.51
ְׁ
0.51
smithy
0.50
Biografi
0.48
finished
0.48
معلومات
0.47
recentemente
0.47
Activations Density 0.156%