INDEX
Explanations
references to ratios or comparisons between quantities
New Auto-Interp
Negative Logits
runk
-0.62
Nec
-0.62
Will
-0.60
PEND
-0.59
moveToFirst
-0.59
-0.58
Го
-0.58
Buk
-0.58
oinette
-0.57
\\
-0.57
POSITIVE LOGITS
ratio
2.38
Ratio
2.33
ratios
2.30
RATIO
2.25
ratio
2.15
Ratios
2.11
Ratio
2.06
ratios
1.88
RATIO
1.76
ration
1.29
Activations Density 0.063%