INDEX
Explanations
mathematical and technical symbols or notations
New Auto-Interp
Negative Logits
bkz
-0.88
trin
-0.82
Noth
-0.79
trin
-0.77
pst
-0.74
GTR
-0.73
Gwend
-0.72
Zin
-0.72
Lizzy
-0.72
ׂ
-0.72
POSITIVE LOGITS
.]
0.97
]].
0.93
],
0.91
_]
0.90
]]
0.89
].
0.86
],
0.85
!]
0.84
}]
0.82
$]$
0.81
Activations Density 0.411%