INDEX
    Explanations

    repeated use of the verb "was."

    New Auto-Interp
    Negative Logits
    urg
    -1.73
    RS
    -1.72
    nd
    -1.58
    LT
    -1.57
    GM
    -1.52
    EV
    -1.50
    G
    -1.50
    м
    -1.48
    mean
    -1.47
    DD
    -1.45
    POSITIVE LOGITS
    Ļª
    3.84
    3.49
    3.48
    3.48
    3.48
    č↵      
    3.48
    ↵↵         
    3.48
    3.48
    ↵↵              
    3.48
    3.48
    Act Density 0.332%

    No Known Activations