INDEX
Explanations
mentions of a specific date within a larger context
punctuation or sentence endings
New Auto-Interp
Negative Logits
Notting
-0.99
die
-0.79
Prev
-0.71
Bundes
-0.68
Compton
-0.67
Sing
-0.67
Rating
-0.67
Authors
-0.66
Wa
-0.66
Failed
-0.65
POSITIVE LOGITS
enos
0.77
asts
0.64
atz
0.64
amphib
0.62
oids
0.62
ixir
0.61
asio
0.61
transform
0.61
igans
0.60
prisons
0.60
Activations Density 0.000%