INDEX
    Explanations

    organized points or instructions within the text

    New Auto-Interp
    Negative Logits
    提供了
    -0.52
     gek
    -0.50
     wears
    -0.50
     jonge
    -0.48
    här
    -0.47
     okaz
    -0.46
     historical
    -0.46
    DllImport
    -0.46
     Brem
    -0.46
     ordf
    -0.46
    POSITIVE LOGITS
     make
    0.86
     don
    0.78
     use
    0.76
     try
    0.76
     start
    0.72
     pick
    0.71
     take
    0.71
     للمعارف
    0.69
     uſe
    0.69
     ſind
    0.68
    Act Density 0.301%

    No Known Activations