INDEX
Explanations
instances of the word "hike" or "hiking"
references to various types of hikes or increases
New Auto-Interp
Negative Logits
Slot
-0.72
icator
-0.70
Corpus
-0.70
binary
-0.69
liction
-0.69
Dialogue
-0.66
healed
-0.64
istar
-0.63
Saud
-0.62
ÑĮ
-0.61
POSITIVE LOGITS
hike
0.90
hiking
0.87
stakes
0.75
hikes
0.75
iking
0.74
biking
0.70
weed
0.69
jriwal
0.68
lift
0.67
otin
0.66
Activations Density 0.016%