INDEX
Explanations
terms that indicate approximation or roughness
New Auto-Interp
Negative Logits
ola
-0.66
klart
-0.58
TV
-0.57
chaîne
-0.56
mela
-0.56
test
-0.55
cassert
-0.55
SuppressLint
-0.55
testa
-0.54
useState
-0.54
POSITIVE LOGITS
Rough
1.25
__*/
1.18
rough
1.16
Rough
1.15
getRule
1.07
approximations
1.06
rough
1.05
Approximate
1.05
NameInMap
1.04
approximated
1.04
Activations Density 0.153%