INDEX
Explanations
mentions of the name "Nik" or variations thereof
New Auto-Interp
Negative Logits
berman
-0.16
iger
-0.16
Trace
-0.15
pz
-0.15
uchs
-0.15
acht
-0.15
Gim
-0.15
trace
-0.14
.ops
-0.14
ÙĪØ§Øª
-0.14
POSITIVE LOGITS
itas
0.24
olas
0.23
laus
0.23
las
0.21
hil
0.20
iten
0.16
ita
0.16
kip
0.16
sic
0.15
ritz
0.15
Activations Density 0.010%