INDEX
Explanations
restaurant or food-related terms marked as 'signature.'
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1178
+0.08
0.3%
1757
+0.08
0.3%
1492
+0.08
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
2040
+0.08
0.02
1494
+0.08
0.03
1690
+0.08
0.03
Negative Logits
ⓧ
-0.83
<bos>
-0.74
/***
-0.67
/*
-0.63
conquête
-0.62
///**
-0.60
/**
-0.59
avoid
-0.56
encourage
-0.55
//
-0.52
POSITIVE LOGITS
signature
2.82
signatures
2.55
Signature
2.54
signature
2.51
Signatures
2.38
Signature
2.29
SIGNATURE
2.28
signatures
2.06
SIGNATURE
1.87
Signatures
1.83
Activations Density 0.147%