INDEX
    Explanations

    role definition prompts

    New Auto-Interp
    Negative Logits
     article
    0.73
    articolo
    0.70
    aurais
    0.69
    जब
    0.68
     théor
    0.68
     articles
    0.67
     സമ്മ
    0.67
     deinem
    0.66
     theor
    0.65
    juris
    0.64
    POSITIVE LOGITS
     Kontrolle
    0.76
     Controle
    0.75
     Control
    0.74
     Radiat
    0.73
     controle
    0.72
     controla
    0.72
    แข่งขัน
    0.70
     facing
    0.70
     trapped
    0.70
    控制
    0.70
    Act Density 0.024%

    No Known Activations