INDEX
    Explanations

    information about fictional characters and their roles in narratives.

    New Auto-Interp
    Negative Logits
     postavlj
    0.31
    விலான
    0.30
    جار
    0.27
    ரன்
    0.27
    ByUser
    0.27
    FuncName
    0.27
     olmayan
    0.27
    0.27
    0.27
     بالش
    0.27
    POSITIVE LOGITS
     is
    0.59
    ?
    0.57
     it
    0.53
     you
    0.51
     they
    0.47
    ؟
    0.46
    0.45
    soever
    0.45
    ?"
    0.44
    you
    0.43
    Act Density 0.552%

    No Known Activations