INDEX
    Explanations

    paragraphs in coding language

    opening parentheses in the text

    New Auto-Interp
    Negative Logits
     EVs
    -0.66
     Hemisphere
    -0.65
     ESV
    -0.62
     fung
    -0.62
     {*
    -0.60
     retali
    -0.59
     nond
    -0.58
     TTL
    -0.58
    orate
    -0.58
     Federation
    -0.57
    POSITIVE LOGITS
    catentry
    0.94
    onduct
    0.81
    hello
    0.77
    aber
    0.75
    english
    0.75
    igration
    0.75
    antic
    0.74
    bour
    0.73
    outh
    0.72
    unicip
    0.71
    Act Density 0.010%

    No Known Activations