INDEX
Explanations
dates written in a specific format (full weekday, month, day, year)
New Auto-Interp
Negative Logits
unpre
-0.76
Prelude
-0.69
apprehension
-0.66
bottleneck
-0.66
psy
-0.65
manifold
-0.65
revived
-0.64
Metallic
-0.64
doubling
-0.63
flares
-0.63
POSITIVE LOGITS
isdom
1.34
alking
1.33
orst
1.33
idespread
1.30
restling
1.30
atson
1.29
esley
1.29
orthy
1.25
izards
1.25
olves
1.25
Activations Density 0.029%