INDEX
Explanations
symbolic representations or visual elements in the document
New Auto-Interp
Negative Logits
purpoſe
-0.79
leaſt
-0.78
ſelves
-0.77
uſed
-0.74
reaſon
-0.74
becauſe
-0.72
greateſt
-0.71
poffible
-0.70
uſe
-0.70
ſen
-0.70
POSITIVE LOGITS
.$,
0.72
OFDb
0.71
اریخ
0.66
Tembelea
0.63
}{*}{0.63
AnchorStyles
0.60
}{*}{}0.58
surla
0.57
/}
0.56
ⓧ
0.56
Activations Density 0.266%