INDEX
    Explanations

    numerical data and statistics related to study results

    New Auto-Interp
    Negative Logits
     Efq
    -1.06
     Monfieur
    -1.03
     raiſ
    -0.98
     itſelf
    -0.95
     Theſe
    -0.95
     Houſe
    -0.94
    CloseOperation
    -0.94
     myſelf
    -0.93
     ſeveral
    -0.92
    ]='\
    -0.92
    POSITIVE LOGITS
    .
    0.60
    0.55
    ,
    0.55
    i
    0.49
     Chwiliwch
    0.48
    x
    0.44
    os
    0.44
    ?
    0.44
    !
    0.42
     –
    0.42
    Act Density 0.011%

    No Known Activations