INDEX
Explanations
references to specific dates and locations
New Auto-Interp
Negative Logits
onomy
-0.65
redo
-0.64
Silence
-0.62
hra
-0.61
silence
-0.59
gif
-0.59
zed
-0.58
sed
-0.58
peat
-0.58
rieg
-0.57
POSITIVE LOGITS
FINE
0.80
tourist
0.71
Canal
0.69
Liberties
0.68
quartered
0.66
orate
0.66
ember
0.64
Railway
0.64
rainy
0.64
ukong
0.63
Activations Density 2.721%