INDEX
Explanations
the presence of the word "create" in various forms and contexts
New Auto-Interp
Negative Logits
'
-0.55
und
-0.54
For
-0.51
and
-0.51
sel
-0.48
likely
-0.48
,
-0.46
in
-0.45
elf
-0.45
plained
-0.44
POSITIVE LOGITS
create
2.02
CREATE
1.03
creation
1.01
متعلقه
0.94
creates
0.94
creation
0.93
create
0.92
creat
0.92
creazione
0.92
Create
0.90
Activations Density 0.083%