INDEX
Explanations
the verb "is" and its variations representing states or conditions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.34
1.5%
1937
+0.12
0.5%
82
+0.10
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1937
+0.34
0.07
651
+0.12
0.04
1351
+0.10
0.05
Negative Logits
<bos>
-2.89
<?
-0.95
-0.84
/*
-0.74
ⓧ
-0.70
/**
-0.67
InvalidProtocol
-0.60
照
-0.59
companion
-0.59
#
-0.58
POSITIVE LOGITS
accla
1.56
maneu
1.54
milf
1.41
disgra
1.40
impra
1.39
increa
1.39
affor
1.35
hairc
1.34
scrat
1.33
guarante
1.30
Activations Density 0.251%