INDEX
    Explanations

    symbols and characters that are uncommon or not typically found in regular text

    special characters or symbols typically found in non-English text

    New Auto-Interp
    Negative Logits
     itch
    -0.73
     Gemini
    -0.72
    raints
    -0.71
    utterstock
    -0.67
     viability
    -0.67
    ukong
    -0.66
     inertia
    -0.66
    etsk
    -0.65
     Clarks
    -0.65
     tentacles
    -0.64
    POSITIVE LOGITS
    ña
    1.06
    ñ
    1.01
    ÃĽ
    0.99
    ï¸ı
    0.91
    lean
    0.91
    ļ
    0.87
    kay
    0.87
    ¹
    0.86
    µ
    0.86
    rug
    0.85
    Act Density 0.029%

    No Known Activations