INDEX
    Explanations

    Book of Job

    New Auto-Interp
    Negative Logits
     inklud
    -0.08
     הכר
    -0.08
    =>{↵
    -0.08
     комб
    -0.08
     чтоб
    -0.08
    љ
    -0.08
    add
    -0.08
    ilar
    -0.08
     =>{↵
    -0.08
     урож
    -0.08
    POSITIVE LOGITS
     picky
    0.09
     instructional
    0.08
    inho
    0.07
    .fxml
    0.07
    Undo
    0.07
    ivel
    0.07
     frankly
    0.07
    INF
    0.07
    Instructions
    0.07
     nọ
    0.07
    Act Density 0.001%

    No Known Activations