INDEX
    Explanations

    mathematical symbols and notations, particularly involving dollar signs indicating equations or variables

    New Auto-Interp
    Negative Logits
     iſt
    -0.96
     Reſ
    -0.90
     Eſ
    -0.86
     Anſ
    -0.85
    ſelves
    -0.85
    ly
    -0.83
     Inſ
    -0.81
     ſy
    -0.80
    ••••
    -0.80
     ſind
    -0.78
    POSITIVE LOGITS
    \}$
    1.09
     }}$
    1.07
    }$
    1.05
    ]$
    1.04
    }]$
    1.03
    )$
    1.03
     )}$
    1.03
    }\}$
    1.03
    )}$
    1.02
    ]}$
    1.02
    Act Density 0.495%

    No Known Activations