INDEX
    Explanations

    statements that express uncertainty or declare something unexpected

    New Auto-Interp
    Negative Logits
     enfans
    -0.57
     Italij
    -0.55
     Königin
    -0.50
     ſtand
    -0.49
     siinä
    -0.48
     récents
    -0.48
    jenigen
    -0.48
     käytt
    -0.47
     Flasche
    -0.47
     esprits
    -0.46
    POSITIVE LOGITS
     للمعارف
    0.51
     ['./
    0.51
    ########.
    0.46
     useAppContext
    0.45
     Bruno
    0.44
     announce
    0.43
     happening
    0.43
     embarrassing
    0.40
     THIS
    0.40
     Gai
    0.40
    Act Density 0.041%

    No Known Activations