INDEX
Explanations
expressions of approximation or near-certainty
New Auto-Interp
Negative Logits
lugs
-0.72
Guen
-0.66
bufio
-0.65
Magdeburg
-0.64
piele
-0.62
culoare
-0.62
GenerationType
-0.61
flamengo
-0.61
ṇ
-0.60
seamnă
-0.60
POSITIVE LOGITS
nearly
1.23
almost
1.14
Casi
1.09
Almost
1.07
Almost
1.05
Nearly
1.04
nearly
1.02
Casi
1.01
>\<^
0.97
Nearly
0.97
Activations Density 0.078%