INDEX
Explanations
mathematical symbols and notation
New Auto-Interp
Negative Logits
})*/
-0.89
://"
-0.74
>(
-0.68
>::
-0.68
*/)
-0.65
>")
-0.62
});*/
-0.61
"}"
-0.61
']}
-0.61
}^{-}$-0.61
POSITIVE LOGITS
Majefty
0.98
Anſ
0.94
greateſt
0.94
purpoſe
0.93
Reſ
0.90
ſever
0.90
himſelf
0.89
ſch
0.89
ſtate
0.88
ſta
0.88
Activations Density 0.071%