INDEX
Explanations
detailed descriptions or features of a product or service
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
876
+0.09
0.3%
441
+0.09
0.2%
1252
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
441
+0.09
0.04
1859
+0.09
0.02
1368
+0.08
0.02
Negative Logits
LookAnd
-0.69
GEBURTSDATUM
-0.65
Televis
-0.65
hadas
-0.63
Rumuni
-0.63
Хьажоргаш
-0.61
Filmo
-0.60
Forrás
-0.60
Geografi
-0.59
lenker
-0.59
POSITIVE LOGITS
shenan
1.18
snoopy
1.08
hairc
1.08
intersper
1.07
sophistic
1.01
simpsons
1.01
encomp
1.00
apprehen
1.00
depic
0.98
wikihow
0.98
Activations Density 0.701%