INDEX
    Explanations

    quick prompts, questions, announcements, reflexes

    New Auto-Interp
    Negative Logits
    "\
    1.95
    ا
    1.91
    ка
    1.90
    er
    1.86
     längst
    1.80
    ر
    1.80
     stadig
    1.76
    Mostrar
    1.76
    Benzyl
    1.73
     inzwischen
    1.72
    POSITIVE LOGITS
    !--
    1.90
     adultery
    1.82
    aneous
    1.82
    aneously
    1.78
     glance
    1.77
     birdseye
    1.73
    aneity
    1.71
     dox
    1.65
    یکی
    1.64
    encji
    1.62
    Act Density 0.284%

    No Known Activations