INDEX
Explanations
time-related events or actions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.19
0.9%
381
+0.14
0.7%
1535
+0.13
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1023
+0.19
0.08
442
+0.14
0.08
897
+0.13
0.08
Negative Logits
<bos>
-2.62
ⓧ
-1.13
<?
-0.83
/***
-0.80
/**
-0.79
/*
-0.75
-0.68
///**
-0.64
/**
-0.61
#![
-0.59
POSITIVE LOGITS
soulign
1.09
catég
1.02
Juf
1.01
maksi
1.00
Keny
0.99
jaya
0.98
Febru
0.95
Balik
0.95
mavi
0.95
fortn
0.95
Activations Density 0.711%