INDEX
Explanations
references to past events and connections
New Auto-Interp
Negative Logits
opak
-0.18
oti
-0.15
isclosed
-0.15
iej
-0.15
zeitig
-0.15
ocha
-0.14
thood
-0.14
thouse
-0.14
ieri
-0.14
entric
-0.14
POSITIVE LOGITS
dating
0.88
dates
0.85
date
0.77
dating
0.69
Dating
0.68
dates
0.67
dated
0.66
Dates
0.64
DATE
0.61
date
0.59
Activations Density 0.428%