INDEX
Explanations
dates, specifically those in the format of day followed by month followed by year
references to the year 199
New Auto-Interp
Negative Logits
orc
-0.90
aminer
-0.88
ensional
-0.81
orem
-0.74
anguage
-0.74
igent
-0.72
ellation
-0.72
arling
-0.71
ional
-0.71
heed
-0.70
POSITIVE LOGITS
th
1.06
âĸĪâĸĪ
0.93
09
0.87
61
0.86
05
0.86
08
0.86
07
0.84
03
0.82
059
0.82
06
0.82
Activations Density 0.025%