INDEX
    Explanations

    references to geometric shapes, specifically circles and related terms

    New Auto-Interp
    Negative Logits
    ntax
    -0.86
     pitié
    -0.80
    +#+
    -0.79
    niest
    -0.77
    <thead>
    -0.77
     Fordham
    -0.76
    MenuInflater
    -0.75
    zczegól
    -0.73
    isiaj
    -0.73
     Mahomet
    -0.73
    POSITIVE LOGITS
     Circle
    2.40
     circle
    2.39
     CIRCLE
    2.27
     circles
    2.26
    Circle
    2.23
     Circles
    2.12
    circle
    2.11
    CIRCLE
    1.94
    circles
    1.93
    Circles
    1.92
    Act Density 0.074%

    No Known Activations