INDEX
Explanations
mentions of specific grades in a school setting
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
82
+0.08
0.3%
795
+0.08
0.3%
50
+0.08
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1354
+0.08
0.03
1377
+0.08
0.03
1421
+0.08
0.03
Negative Logits
<bos>
-0.88
public
-0.64
//
-0.62
-0.61
/*
-0.60
continue
-0.59
be
-0.58
usercontent
-0.57
lập
-0.56
,
-0.56
POSITIVE LOGITS
grade
2.58
Grade
2.43
grades
2.37
Grades
2.28
Grade
2.27
grade
2.21
GRADE
2.19
Grades
2.02
grades
1.96
grading
1.78
Activations Density 0.157%