INDEX
Explanations
references to a specific individual named Stefan
New Auto-Interp
Negative Logits
landing
-0.16
us
-0.15
am
-0.15
AMB
-0.15
uw
-0.15
plane
-0.15
wik
-0.15
lando
-0.15
stash
-0.14
red
-0.14
POSITIVE LOGITS
lucent
0.17
jiang
0.15
åĨĨ
0.15
SCRIBE
0.15
itsu
0.15
uptools
0.14
CCR
0.14
udded
0.14
ãĤ¯ãĥĪ
0.14
asaki
0.14
Activations Density 0.014%