INDEX
Explanations
references to historical significance or properties
New Auto-Interp
Negative Logits
Hist
-0.51
Hist
-0.50
lot
-0.44
vivi
-0.41
.*")]
-0.41
●
-0.41
upkeep
-0.41
%%
-0.39
hist
-0.38
dec
-0.38
POSITIVE LOGITS
historical
0.93
holy
0.90
historical
0.85
historic
0.83
Holy
0.81
Historical
0.81
historic
0.81
históricas
0.78
Holy
0.77
historischen
0.77
Activations Density 0.082%