INDEX
Explanations
references to statues
mentions of statues
New Auto-Interp
Negative Logits
DN
-0.77
Coastal
-0.73
avez
-0.72
Raw
-0.70
CE
-0.68
ricular
-0.65
comes
-0.65
actic
-0.64
ells
-0.63
saline
-0.63
POSITIVE LOGITS
statue
1.38
statues
1.24
sculpture
1.15
sculptures
0.98
Statue
0.95
erected
0.92
monument
0.86
figur
0.83
honoring
0.81
mage
0.81
Activations Density 0.012%