INDEX
    Explanations

    exclamatory oh god/gosh

    New Auto-Interp
    Negative Logits
    0.81
    u
    0.69
     in
    0.67
    0.66
     murder
    0.65
    0.63
    d
    0.63
     inconsider
    0.63
    s
    0.61
    0.60
    POSITIVE LOGITS
    at
    0.81
    ك
    0.71
    for
    0.70
    isem
    0.67
    0.66
    ва
    0.65
    را
    0.65
     for
    0.64
    م
    0.63
    र्निंग
    0.62
    Act Density 0.000%

    No Known Activations