INDEX
    Explanations

    asking questions and requiring things

    New Auto-Interp
    Negative Logits
    ದಲ
    0.45
     kinderg
    0.45
     যায়
    0.43
    াদেশিক
    0.42
    0.40
     tenders
    0.40
     মার্চের
    0.40
     illustrated
    0.40
     நூற்றாண்டின்
    0.40
     nativity
    0.40
    POSITIVE LOGITS
    SOR
    0.47
    Sor
    0.43
    Lor
    0.43
    Expression
    0.41
    Shir
    0.39
    ̡
    0.38
     Sor
    0.38
    Canon
    0.38
    CQL
    0.38
    Sorry
    0.37
    Act Density 0.000%

    No Known Activations