INDEX
    Explanations

    comparative phrases highlighting similarities or analogies

    New Auto-Interp
    Negative Logits
     itſelf
    -1.04
     raiſ
    -1.00
    ſelf
    -0.94
     pleaſure
    -0.93
     Houſe
    -0.92
     Anſ
    -0.90
     Reſ
    -0.88
     Conſ
    -0.88
     houſe
    -0.87
    venidos
    -0.86
    POSITIVE LOGITS
     AS
    1.21
     As
    1.19
     as
    1.12
    As
    1.05
    readAs
    1.04
    AS
    0.85
     CreateTagHelper
    0.82
    ValueStyle
    0.81
    as
    0.75
    follows
    0.74
    Act Density 0.360%

    No Known Activations