INDEX
Explanations
terms related to clothing and fashion trends
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1045
+0.10
0.3%
1328
+0.08
0.2%
68
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1385
+0.10
0.03
12
+0.08
0.02
1218
+0.07
0.02
Negative Logits
<bos>
-1.03
gov
-0.72
uesia
-0.68
</tbody>
-0.65
docs
-0.62
HideFlags
-0.61
후
-0.61
raise
-0.61
بتاريخ
-0.60
,
-0.60
POSITIVE LOGITS
closet
2.01
Closet
1.84
closet
1.78
closets
1.72
stockholm
1.68
sappi
1.56
lidl
1.56
affor
1.55
wien
1.54
maneu
1.54
Activations Density 0.138%