INDEX
Explanations
specific names or attributes related to mountains
mentions of "mountain."
New Auto-Interp
Negative Logits
BACK
-0.85
lest
-0.82
Dialogue
-0.76
encia
-0.73
NER
-0.73
âĸ¬
-0.72
uel
-0.72
tle
-0.71
orate
-0.70
âĸ¬âĸ¬
-0.70
POSITIVE LOGITS
biking
1.13
mountain
0.97
Everest
0.89
ridge
0.88
climbers
0.88
skiing
0.86
footh
0.86
Sinai
0.85
mountains
0.82
laure
0.81
Activations Density 0.012%