INDEX
Explanations
the word "there" occurring with high activation values
the repeated phrase "there" indicating emphasis on existence or presence in various contexts
New Auto-Interp
Negative Logits
shoot
-0.65
Honour
-0.64
ONSORED
-0.62
Heights
-0.59
ship
-0.58
uously
-0.57
tnc
-0.57
full
-0.56
isable
-0.56
ée
-0.54
POSITIVE LOGITS
abouts
1.48
etheless
0.98
upon
0.95
fore
0.81
after
0.75
are
0.74
aren
0.72
FORE
0.72
choes
0.71
enty
0.71
Activations Density 0.108%