INDEX
Explanations
announcement texts with specific event details and invitation language
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.22
0.9%
2019
+0.11
0.4%
752
+0.08
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
600
+0.22
0.06
1659
+0.11
0.06
1263
+0.08
0.06
Negative Logits
<bos>
-2.51
ⓧ
-1.04
/**
-1.03
<?
-1.03
-0.97
/***
-0.88
/*
-0.84
<?
-0.81
disbur
-0.74
/*!
-0.72
POSITIVE LOGITS
seksi
1.11
keramik
1.00
marte
0.97
silikon
0.96
kafe
0.95
nomine
0.89
santiago
0.88
catég
0.87
kredi
0.87
maroc
0.87
Activations Density 0.061%