INDEX
Explanations
mathematical expressions or labels related to equations
New Auto-Interp
Negative Logits
endpush
-0.53
transQ
-0.47
GHIJKLM
-0.46
GIVEREF
-0.45
privatisation
-0.45
newOwner
-0.43
concor
-0.40
ſta
-0.40
TokenNameDOT
-0.39
ec
-0.39
POSITIVE LOGITS
label
3.45
label
2.89
Label
2.61
Label
2.61
LABEL
2.27
LABEL
2.25
labels
2.14
Labels
1.98
labels
1.96
labeling
1.96
Activations Density 0.023%