INDEX
Explanations
dates written in a specific format
numerical dates and significant time markers
New Auto-Interp
Negative Logits
------------------------
-0.95
Huss
-0.83
horm
-0.81
Greenberg
-0.79
Magn
-0.78
Hasan
-0.77
Herm
-0.76
Hag
-0.76
hap
-0.75
Sherman
-0.73
POSITIVE LOGITS
2016
1.42
2016
1.31
16
1.11
16
0.95
166
0.89
1916
0.88
aton
0.82
016
0.80
166
0.80
大
0.78
Activations Density 0.279%