INDEX
    Explanations

    terms related to modes and references to planets

    New Auto-Interp
    Negative Logits
     Efq
    -1.80
     itſelf
    -1.69
     myſelf
    -1.67
     Monfieur
    -1.62
     Jefus
    -1.61
     pleaſure
    -1.55
     Theſe
    -1.54
     Reſ
    -1.51
     ſeveral
    -1.47
     themſelves
    -1.45
    POSITIVE LOGITS
    mode
    1.27
    MODE
    1.07
     mode
    1.02
    <eos>
    0.95
    0.91
    ↵↵
    0.85
     I
    0.83
    Mode
    0.82
     ש
    0.82
    0.81
    Act Density 0.129%

    No Known Activations