INDEX
Explanations
years related to historical events
references to significant historical years and events
New Auto-Interp
Negative Logits
amin
-0.83
umen
-0.82
ende
-0.82
iant
-0.78
axis
-0.71
hus
-0.70
tan
-0.70
lig
-0.70
Marco
-0.70
attr
-0.69
POSITIVE LOGITS
1916
1.11
1917
1.09
ĸļ
1.04
1918
1.00
1915
1.00
1914
1.00
1909
0.94
1906
0.90
1919
0.89
1905
0.89
Activations Density 0.012%