INDEX
    Explanations

    relative pronouns introducing clauses

    New Auto-Interp
    Negative Logits
     disa
    0.64
    정을
    0.57
     rely
    0.56
     belum
    0.55
     परेशान
    0.54
     fallback
    0.53
     јед
    0.53
     Rely
    0.52
     परेशानी
    0.52
     Tidak
    0.51
    POSITIVE LOGITS
     them
    0.56
     подробно
    0.54
     then
    0.52
    ина
    0.51
    会将
    0.50
     scoops
    0.48
    告知
    0.47
     engraving
    0.47
     luscious
    0.46
    では
    0.45
    Act Density 0.111%

    No Known Activations