INDEX
Explanations
mentions of specific locations, most notably mountains
references to specific locations, events, or timelines in narratives
New Auto-Interp
Negative Logits
Cosponsors
-0.86
awi
-0.68
cleaners
-0.65
theless
-0.65
locations
-0.60
omever
-0.60
likes
-0.59
strengths
-0.59
ierrez
-0.58
qualities
-0.57
POSITIVE LOGITS
enium
0.77
millennium
0.70
iggurat
0.69
window
0.67
countdown
0.65
ilogy
0.65
\'
0.64
cycle
0.64
(~
0.64
yrus
0.63
Activations Density 0.475%