INDEX
    Explanations

    evidence of decision-making or processes involving improvement and verification

    Beginning of foreign language words

    New Auto-Interp
    Negative Logits
     Monfieur
    -0.57
    blers
    -0.55
    esian
    -0.55
    middot
    -0.54
    ieso
    -0.53
     Bikin
    -0.53
    
    -0.53
     grown
    -0.52
    verkehr
    -0.52
    geord
    -0.52
    POSITIVE LOGITS
     went
    0.96
     came
    0.92
     took
    0.92
     was
    0.91
     did
    0.88
     gave
    0.86
     flew
    0.80
     has
    0.79
    came
    0.76
     began
    0.75
    Act Density 0.120%

    No Known Activations