INDEX
Explanations
references to beers or related terms like hops, brewery, or craft beer
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
920
+0.14
0.5%
1472
+0.14
0.5%
869
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
920
+0.14
0.03
1472
+0.14
0.03
869
+0.12
0.03
Negative Logits
hek
-0.73
BeginInit
-0.67
krab
-0.65
republi
-0.61
minimalis
-0.61
meras
-0.59
Xuất
-0.58
Mathem
-0.56
Sucher
-0.55
psychiat
-0.55
POSITIVE LOGITS
beer
1.31
Beer
1.20
BEER
1.15
beers
1.13
Beer
1.09
beer
1.07
brewery
0.93
beers
0.87
Beers
0.87
cerveza
0.79
Activations Density 0.080%