INDEX
Explanations
connections and contrasts between different processes or concepts, particularly in scientific and technical contexts
New Auto-Interp
Negative Logits
None
-0.14
competitors
-0.14
none
-0.14
nothing
-0.14
.None
-0.14
anlık
-0.13
had
-0.13
None
-0.13
.directive
-0.13
already
-0.13
POSITIVE LOGITS
unlike
0.25
Common
0.22
Unlike
0.21
Unlike
0.21
TYPES
0.21
modern
0.20
wikipedia
0.20
generally
0.20
most
0.20
depending
0.19
Activations Density 0.351%