INDEX
    Explanations

    the beginning of a new segment in the text

    New Auto-Interp
    Negative Logits
     -------
    -0.73
    hibernate
    -0.73
    ʹ
    -0.70
    ンダー
    -0.70
    hwnd
    -0.69
    Eugen
    -0.68
     Miche
    -0.68
    ;"></
    -0.68
    idste
    -0.68
    <<<<<<<<
    -0.67
    POSITIVE LOGITS
    1.02
    ...
    0.86
    […]
    0.86
     متعلقه
    0.84
     Kowalski
    0.82
     macrophages
    0.81
    makeConstraints
    0.79
    onnaissance
    0.78
     Lazarus
    0.78
     Telex
    0.78
    Act Density 0.211%

    No Known Activations