INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    HEIM
    -0.83
    Źródło
    -0.79
    inaire
    -0.73
    Tell
    -0.71
    ğun
    -0.71
     рассказыва
    -0.70
     września
    -0.69
    SourceChecksum
    -0.69
    κά
    -0.69
    cises
    -0.68
    POSITIVE LOGITS
     reads
    5.00
     read
    4.13
    reads
    3.45
     Reads
    3.19
    Reads
    2.94
    read
    2.80
     reading
    2.77
    2.38
     READ
    2.25
     readings
    2.14
    Act Density 0.046%

    No Known Activations