INDEX
Explanations
numerical values related to dates, numbers, and codes
New Auto-Interp
Negative Logits
isman
-0.65
etimes
-0.65
aned
-0.64
scrut
-0.64
endors
-0.63
knowledgeable
-0.63
ulously
-0.62
emade
-0.60
snowball
-0.60
relentless
-0.59
POSITIVE LOGITS
609
1.07
708
1.06
409
1.03
307
1.02
703
1.02
504
1.02
646
1.02
706
1.01
805
1.01
394
1.01
Activations Density 2.103%