INDEX
Explanations
references to significant or impactful events or encounters
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
313
+0.09
0.3%
1492
+0.08
0.3%
1482
+0.07
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1363
+0.09
0.04
690
+0.08
0.04
78
+0.07
0.02
Negative Logits
<bos>
-1.12
/**
-0.71
<?
-0.70
-0.67
żdy
-0.59
/*
-0.57
<?
-0.57
/*!
-0.56
/***
-0.56
},[])
-0.56
POSITIVE LOGITS
stockholm
0.87
mosso
0.86
scrat
0.84
affor
0.82
Cru
0.81
olx
0.81
strick
0.77
maneu
0.76
mimi
0.76
pymysql
0.75
Activations Density 0.327%