INDEX
Explanations
key numerical values and their context in the text
New Auto-Interp
Negative Logits
551
-0.15
ugo
-0.14
gba
-0.14
514
-0.14
adol
-0.14
qb
-0.14
allo
-0.13
contres
-0.13
itten
-0.13
olt
-0.13
POSITIVE LOGITS
surrounded
0.14
onus
0.14
/includes
0.14
cref
0.14
surround
0.13
Po
0.13
gesi
0.13
.cgi
0.13
ÑĩиÑģ
0.13
DW
0.13
Activations Density 0.021%