INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     elves
    -0.07
     hosted
    -0.07
     registered
    -0.07
    明白
    -0.07
    .ver
    -0.07
    _hostname
    -0.07
     consulate
    -0.06
     jokes
    -0.06
    Versions
    -0.06
    كون
    -0.06
    POSITIVE LOGITS
    breadcrumb
    0.11
    Breadcrumb
    0.10
    breadcrumbs
    0.09
     breadcrumb
    0.09
    readcrumb
    0.09
     breadcrumbs
    0.09
    readcrumbs
    0.07
    car
    0.07
     vener
    0.06
    0.06
    Act Density 0.001%

    No Known Activations