INDEX
Explanations
adverbs describing the degree or level of something
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1306
+0.12
0.4%
1325
+0.11
0.3%
1691
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1691
+0.12
0.03
1306
+0.11
0.03
976
+0.10
0.03
Negative Logits
abstrait
-0.89
avancé
-0.77
exceptionnel
-0.75
keramik
-0.74
précieux
-0.73
Hez
-0.72
typique
-0.71
McKin
-0.70
réaliste
-0.69
industriel
-0.69
POSITIVE LOGITS
fordable
0.69
relatively
0.68
lably
0.65
ighborhood
0.64
requently
0.64
recenti
0.63
nemia
0.63
@"";
0.62
vertisement
0.61
fairly
0.60
Activations Density 0.084%