INDEX
Explanations
information related to historical acts or legislative provisions related to education, specifically higher education and segregation
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1253
+0.12
0.4%
227
+0.11
0.3%
939
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
227
+0.12
0.07
1336
+0.11
0.05
919
+0.08
0.01
Negative Logits
fluo
-0.73
foon
-0.68
macrop
-0.68
juft
-0.68
ftre
-0.67
juf
-0.65
uncin
-0.65
bieber
-0.65
paff
-0.65
laft
-0.64
POSITIVE LOGITS
universities
0.89
colleges
0.87
campuses
0.86
university
0.78
Colleges
0.74
tuition
0.71
college
0.71
Universities
0.69
campus
0.69
graduates
0.69
Activations Density 0.630%