INDEX
Explanations
instances of the word "leap" where the activation value is high
instances of the word "leap" in various contexts
New Auto-Interp
Negative Logits
essee
-1.08
gel
-0.80
Interstitial
-0.78
ividually
-0.70
pmwiki
-0.67
ĻĤ
-0.66
liction
-0.66
gew
-0.64
matter
-0.63
leanor
-0.62
POSITIVE LOGITS
frog
1.14
leaps
1.02
leap
0.79
olicy
0.79
rack
0.76
Leap
0.74
rers
0.73
Rivals
0.69
fruit
0.69
forward
0.68
Activations Density 0.017%