INDEX
Explanations
references to specific numerical values and statistics
New Auto-Interp
Negative Logits
ksam
-0.17
otlin
-0.17
oref
-0.16
orgot
-0.16
jspx
-0.15
vat
-0.15
eming
-0.14
cisi
-0.14
ogn
-0.14
/docs
-0.14
POSITIVE LOGITS
5
0.31
pent
0.28
Pent
0.28
äºĶ
0.27
five
0.27
five
0.26
-five
0.25
Five
0.25
ï
0.24
пÑıÑĤÑĮ
0.24
Activations Density 0.151%