INDEX
Explanations
mathematical expressions and notations
New Auto-Interp
Negative Logits
,
-0.60
-0.52
[]
-0.52
Bar
-0.51
void
-0.50
bar
-0.50
thâu
-0.48
w
-0.48
fan
-0.47
W
-0.47
POSITIVE LOGITS
avoient
0.97
feroit
0.96
étoit
0.93
auroit
0.90
étoient
0.88
}}}}
0.88
pouvoit
0.85
ainfi
0.81
}}}
0.81
)))))
0.81
Activations Density 1.056%