INDEX
Explanations
occurrences of the term "Captain" associated with various contexts
New Auto-Interp
Negative Logits
adoo
-0.17
ortic
-0.17
Specifier
-0.16
intree
-0.16
alus
-0.15
igne
-0.15
Naked
-0.15
binh
-0.15
.cent
-0.15
perf
-0.14
POSITIVE LOGITS
cy
0.26
America
0.21
cies
0.20
kap
0.18
captain
0.17
Kirk
0.17
agn
0.17
amic
0.17
Captain
0.16
-command
0.16
Activations Density 0.011%