INDEX
Explanations
actions of transforming or converting things into something else
phrases about transformation or conversion processes
New Auto-Interp
Negative Logits
cation
-0.77
inately
-0.73
ritz
-0.72
erity
-0.70
ran
-0.69
ername
-0.68
enance
-0.66
antage
-0.65
raint
-0.64
no
-0.64
POSITIVE LOGITS
usable
0.88
something
0.73
profitable
0.65
ãĥ¼ãĥ
0.64
quished
0.64
AFTA
0.62
a
0.61
Obj
0.61
surrogate
0.59
productive
0.59
Activations Density 0.071%