INDEX
    Explanations

    phrases associated with causation and processes

    New Auto-Interp
    Negative Logits
    ImageContext
    -0.82
    <bos>
    -0.66
     nonUne
    -0.64
    Portail
    -0.61
    はじめに
    -0.60
    InputBorder
    -0.59
    Rüyada
    -0.59
    ijų
    -0.57
    posedge
    -0.54
    彿
    -0.54
    POSITIVE LOGITS
     purpoſe
    0.66
     myſelf
    0.64
     itſelf
    0.61
     Jefus
    0.61
     raiſ
    0.61
    ſelf
    0.60
    hematical
    0.59
     himſelf
    0.58
     paſſ
    0.57
     Efq
    0.57
    Act Density 0.739%

    No Known Activations