INDEX
Explanations
places and landscapes related to mountains
references to mountainous locations
New Auto-Interp
Negative Logits
tle
-0.75
NER
-0.70
INAL
-0.70
IAL
-0.69
ABLE
-0.67
andum
-0.64
ASED
-0.62
ership
-0.62
absentee
-0.60
phas
-0.60
POSITIVE LOGITS
mith
1.02
hooting
1.01
ides
0.99
creen
0.98
cape
0.95
challeng
0.89
hips
0.88
ight
0.87
hed
0.87
cale
0.86
Activations Density 0.052%