INDEX
Explanations
references to pilots in various contexts
New Auto-Interp
Negative Logits
ity
-0.20
ITY
-0.17
rin
-0.17
hoff
-0.16
lider
-0.16
nal
-0.15
lv
-0.15
alborg
-0.15
elt
-0.15
alore
-0.15
POSITIVE LOGITS
pil
0.18
beam
0.16
fish
0.16
cy
0.15
enez
0.15
ÑģÑĤва
0.15
grams
0.15
Jeb
0.15
ERSIST
0.15
chap
0.15
Activations Density 0.009%