INDEX
Explanations
variable and equation representations in mathematical contexts
New Auto-Interp
Negative Logits
>>)
-0.15
lag
-0.15
');↵
-0.14
,:);↵
-0.14
ocale
-0.14
GOODMAN
-0.14
LAG
-0.14
tam
-0.14
velop
-0.13
aye
-0.13
POSITIVE LOGITS
)]
0.29
)}
0.21
)]↵
0.20
}}
0.19
)];
0.18
)}"↵
0.18
.")]↵
0.18
]]
0.18
)]↵
0.18
']}↵
0.17
Activations Density 0.133%