INDEX
Explanations
phrases related to sports teams and players
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.19
0.9%
1334
+0.11
0.5%
645
+0.09
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1334
+0.19
0.08
1415
+0.11
0.05
122
+0.09
0.06
Negative Logits
<bos>
-1.52
intersper
-1.37
endow
-0.85
gratify
-0.84
/***
-0.83
ascribe
-0.80
rouse
-0.78
banish
-0.77
harmonize
-0.77
acquaint
-0.76
POSITIVE LOGITS
venuto
0.98
dimentic
0.76
rechange
0.75
riuscito
0.72
rimasto
0.72
sentito
0.70
potuto
0.69
pymongo
0.68
chrysler
0.68
innamor
0.68
Activations Density 0.562%