INDEX
Explanations
websites, online games, and gaming portals
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.20
1.0%
1870
+0.13
0.7%
783
+0.12
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
783
+0.20
0.12
1870
+0.13
0.07
2036
+0.12
0.08
Negative Logits
<bos>
-3.21
ⓧ
-1.07
/**
-0.86
négociations
-0.83
/*
-0.80
-0.78
//{
-0.73
Източници
-0.72
/***
-0.70
Fordítás
-0.69
POSITIVE LOGITS
impra
1.46
madonna
1.39
maneu
1.37
jaya
1.36
maroc
1.31
snoopy
1.30
unden
1.30
unce
1.29
bandung
1.29
panama
1.29
Activations Density 1.631%