INDEX
Explanations
references to global or international contexts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.34
1.2%
2034
+0.20
0.7%
227
+0.10
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
2034
+0.34
0.06
1438
+0.20
0.05
227
+0.10
0.05
Negative Logits
<bos>
-2.07
///**
-0.76
/***
-0.74
maig
-0.66
ɵɵelement
-0.65
//{
-0.61
febr
-0.61
Marzo
-0.61
Molto
-0.60
interessa
-0.59
POSITIVE LOGITS
ecru
0.96
cuck
0.93
Middles
0.89
ankara
0.89
maneu
0.89
outlander
0.88
Inhabitants
0.87
alps
0.85
unwarran
0.84
truction
0.84
Activations Density 0.258%