INDEX
Explanations
references to licenses and legal information within text
New Auto-Interp
Negative Logits
l
-1.08
l
-0.90
ll
-0.65
la
-0.64
le
-0.64
la
-0.60
l
-0.60
le
-0.60
ⅼ
-0.59
ll
-0.58
POSITIVE LOGITS
Л
1.05
Lo
1.01
LL
1.00
L
0.97
LC
0.95
Ли
0.93
LI
0.93
LR
0.91
Li
0.90
LF
0.89
Activations Density 1.309%