INDEX
    Explanations

    technical instructions or definitions within documents

    Introductions or clarifications, often with examples

    introducing examples or hypotheticals

    New Auto-Interp
    Negative Logits
     оригіналу
    -0.69
     تانيه
    -0.69
    iestety
    -0.67
     niestety
    -0.63
    vrigt
    -0.63
     كمان
    -0.62
     tevens
    -0.61
    =$?
    -0.61
     atleast
    -0.61
     valamint
    -0.61
    POSITIVE LOGITS
     say
    1.10
    say
    0.96
     misalnya
    0.95
     bijvoorbeeld
    0.90
     Suppose
    0.89
     například
    0.88
     suppose
    0.87
     beispielsweise
    0.87
    Suppose
    0.86
     مث
    0.83
    Act Density 0.795%

    No Known Activations