INDEX
    Explanations

    sections of code or program-related content

    public method declarations

    New Auto-Interp
    Negative Logits
     defaultstate
    -0.68
    RenderAtEndOf
    -0.68
    MessageTagHelper
    -0.68
     queſta
    -0.60
     ſind
    -0.60
     المعيارى
    -0.59
    KommentareTeilen
    -0.58
     ſte
    -0.57
     stiefe
    -0.55
     indígen
    -0.55
    POSITIVE LOGITS
    ↵↵↵↵↵
    0.56
    ↵↵↵↵
    0.55
    ↵↵↵
    0.55
    ↵↵↵↵↵↵↵
    0.51
    ↵↵↵↵↵↵↵↵↵↵↵
    0.49
    ↵↵↵↵↵↵
    0.48
    ↵↵↵↵↵↵↵↵
    0.45
    ↵↵↵↵↵↵↵↵↵
    0.44
    ......
    0.43
    .*;
    0.43
    Act Density 0.003%

    No Known Activations