INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Negative Logits
    0.74
    0.70
     minify
    0.70
    0.69
    0.69
     mostrar
    0.68
     perone
    0.66
     joten
    0.66
     pediu
    0.66
     kiu
    0.66
    POSITIVE LOGITS
    ق
    0.76
    лно
    0.62
    zelfde
    0.57
    kiej
    0.57
    ר
    0.56
    лни
    0.55
    ح
    0.53
    ו
    0.52
    ע
    0.52
    zione
    0.52
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.