INDEX
    Explanations

    instances of the character 'v' or variants of it in different contexts

    New Auto-Interp
    Negative Logits
     raiſ
    -1.12
     ſche
    -0.97
     myſelf
    -0.97
    ſelf
    -0.95
     Majefty
    -0.91
     purpoſe
    -0.91
    ſelves
    -0.90
     iſt
    -0.89
     Theſe
    -0.89
     ་་
    -0.88
    POSITIVE LOGITS
     v
    2.15
    v
    1.82
     vv
    1.09
    vv
    0.98
    0.97
    Fv
    0.94
     w
    0.91
     vlog
    0.88
     u
    0.85
    vB
    0.82
    Act Density 0.142%

    No Known Activations