INDEX
    Explanations

    instances of the word "example."

    New Auto-Interp
    Negative Logits
    -0.72
     Uint
    -0.67
    ỏi
    -0.67
    日语
    -0.65
    日在
    -0.65
     altas
    -0.63
    وردار
    -0.63
     BufferedReader
    -0.62
    колеп
    -0.61
    บัติ
    -0.61
    POSITIVE LOGITS
     examples
    1.97
     example
    1.82
    examples
    1.80
     EXAMPLE
    1.71
    example
    1.70
    Example
    1.67
     Example
    1.65
     Examples
    1.63
    Examples
    1.56
    EXAMPLE
    1.55
    Act Density 0.074%

    No Known Activations