INDEX
Explanations
dates in a particular format
numerical values associated with significant events or characteristics
New Auto-Interp
Negative Logits
dissu
-0.66
itters
-0.64
utical
-0.63
sacrificing
-0.60
deceived
-0.59
esty
-0.59
doct
-0.58
whine
-0.58
underestimate
-0.58
temptation
-0.58
POSITIVE LOGITS
Contents
1.05
³³³³³³³³³³³³³³³³
1.03
³³³³
1.03
³³³
1.02
³³³³³³³³
0.94
Born
0.88
ccording
0.88
Trivia
0.85
History
0.79
SPONSORED
0.79
Activations Density 0.463%