INDEX
Explanations
references to specific buildings, particularly those known as palaces
references to "Palace" in various contexts
New Auto-Interp
Negative Logits
err
-0.71
arette
-0.70
schild
-0.69
CAST
-0.69
acted
-0.67
rss
-0.67
::::::::
-0.66
enegger
-0.66
went
-0.65
CHAT
-0.65
POSITIVE LOGITS
Palace
0.94
osaurs
0.79
ibur
0.77
intrigue
0.75
palace
0.74
gur
0.74
maiden
0.72
renovation
0.71
Hotel
0.70
holders
0.70
Activations Density 0.012%