INDEX
Explanations
the phrase "I'll" or its variations indicating future intent
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.33
1.8%
381
+0.13
0.7%
411
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
411
+0.33
0.04
2004
+0.13
0.04
1415
+0.12
0.04
Negative Logits
<bos>
-2.38
</h1>
-0.56
HasIndex
-0.55
en
-0.55
/***
-0.55
uni
-0.54
public
-0.54
/**
-0.53
ui
-0.53
مرئيه
-0.52
POSITIVE LOGITS
accla
1.48
reluct
1.47
maneu
1.45
affor
1.41
inev
1.36
shenan
1.35
disreg
1.34
disagre
1.33
unwarran
1.33
impra
1.31
Activations Density 0.070%