INDEX
Explanations
descriptions related to historical landmarks and cultural heritage sites
references to iconic landmarks and heritage sites
New Auto-Interp
Negative Logits
Ops
-0.85
pressure
-0.79
milo
-0.75
diapers
-0.74
mosquit
-0.73
inexperienced
-0.72
diarrhea
-0.71
testers
-0.70
-0.69
iless
-0.69
POSITIVE LOGITS
landmark
1.56
heritage
1.55
landmarks
1.54
monuments
1.50
monument
1.39
treasures
1.37
Historic
1.35
historic
1.34
architectural
1.28
relics
1.27
Activations Density 0.374%