INDEX
    Explanations

    repeated phrases or patterns in text

    New Auto-Interp
    Negative Logits
    -0.67
     {{$
    -0.63
    {{$
    -0.60
    gül
    -0.59
     Bisch
    -0.58
    ად
    -0.57
     point
    -0.56
    @@@@@@@@
    -0.56
    labelledby
    -0.55
    ="{{$
    -0.55
    POSITIVE LOGITS
     ...
    2.96
     ....
    2.33
     …
    2.32
    !...
    2.13
    ...
    2.11
     ..."
    2.11
    (...
    2.09
     .....
    2.08
    ,...
    2.06
     ...)
    2.06
    Act Density 0.136%

    No Known Activations