INDEX
    Explanations

    references to specific plans or frameworks, often labeled as 'schemes'

    New Auto-Interp
    Negative Logits
     EAT
    -0.82
     Constan
    -0.74
     Garry
    -0.71
     Horton
    -0.69
    mort
    -0.69
     Ata
    -0.68
    Visual
    -0.67
     tq
    -0.66
     Visual
    -0.65
     faſt
    -0.64
    POSITIVE LOGITS
     scheme
    2.54
    scheme
    2.50
     Scheme
    2.48
     SCHEME
    2.48
     Schemes
    2.46
     schemes
    2.44
    Scheme
    2.39
    schemes
    2.27
    Schemes
    2.22
    SCHEME
    2.07
    Act Density 0.103%

    No Known Activations