INDEX
    Explanations

    geometry, circles

    New Auto-Interp
    Negative Logits
     administrative
    -0.08
     vacc
    -0.08
    quipe
    -0.07
     Brexit
    -0.07
    ാമ
    -0.07
     professional
    -0.07
    vet
    -0.07
     multinational
    -0.07
    .lex
    -0.07
    .stringify
    -0.07
    POSITIVE LOGITS
     خورد
    0.08
     ellipse
    0.08
     halves
    0.08
     trope
    0.08
     ي
    0.07
     dipping
    0.07
     theorem
    0.07
     noirs
    0.07
     density
    0.07
     خالد
    0.07
    Act Density 0.023%

    No Known Activations