INDEX
    Explanations

    quantitative measures and comparisons

    New Auto-Interp
    Negative Logits
    etty
    -0.15
    heel
    -0.14
    edelta
    -0.14
    ilon
    -0.14
     fewer
    -0.14
     Few
    -0.13
    HEEL
    -0.13
     Feinstein
    -0.13
    uali
    -0.13
    _HS
    -0.13
    POSITIVE LOGITS
     fold
    0.48
     times
    0.47
    -fold
    0.45
    times
    0.44
     TIMES
    0.43
    åĢį
    0.42
    fold
    0.41
    -times
    0.40
     folds
    0.40
     Fold
    0.39
    Act Density 0.080%

    No Known Activations