INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    """
    -0.48
    ].
    -0.44
    );
    -0.42
    -0.42
    ).
    -0.41
    ">
    -0.41
    ];
    -0.40
    .
    -0.40
    <h4>
    -0.39
    */
    -0.39
    POSITIVE LOGITS
     colspan
    1.18
     rowspan
    1.00
    0.94
     المعيارى
    0.90
    iſchen
    0.88
    ſicht
    0.88
    niſſe
    0.84
    ſchaft
    0.84
     MainAxisSize
    0.82
    ロウィン
    0.82
    Act Density 0.091%

    No Known Activations