INDEX
Explanations
dates formatted in the day-month-year structure
New Auto-Interp
Negative Logits
Genie
-0.59
hobbies
-0.55
explan
-0.54
expectations
-0.53
streng
-0.53
DN
-0.51
habitual
-0.51
ilater
-0.50
itton
-0.50
Alban
-0.50
POSITIVE LOGITS
th
1.87
rd
1.33
ths
1.11
teenth
1.11
nd
1.10
TH
1.06
tha
1.01
st
0.95
thus
0.93
thal
0.88
Activations Density 0.084%