INDEX
Explanations
numeric values and numerical patterns
New Auto-Interp
Negative Logits
old
-0.18
nds
-0.17
ounds
-0.17
nd
-0.17
/is
-0.16
ephir
-0.16
ãģĺ
-0.16
byss
-0.16
isper
-0.15
chwitz
-0.15
POSITIVE LOGITS
teenth
0.24
teen
0.17
ëģĶ
0.17
-HT
0.16
th
0.16
bread
0.15
fold
0.15
rou
0.15
Thirty
0.15
ÐĨÐĨ
0.14
Activations Density 0.349%