INDEX
Explanations
ironic situations or events
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.19
0.7%
1370
+0.10
0.3%
492
+0.08
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
94
+0.19
0.04
1784
+0.10
0.04
682
+0.08
0.03
Negative Logits
<bos>
-1.83
/***
-0.68
<?
-0.67
</thead>
-0.60
ⓧ
-0.59
-0.56
/*!
-0.56
raise
-0.52
interface
-0.52
///**
-0.52
POSITIVE LOGITS
maneu
1.24
chrysler
1.16
gaily
1.16
mondeo
1.06
Minang
1.05
reluct
1.03
stockholm
1.02
ricardo
1.00
tolerably
0.99
isuzu
0.98
Activations Density 0.284%