INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    oshi
    -0.51
    Côte
    -0.51
    hye
    -0.49
     propOrder
    -0.47
    implicit
    -0.47
    lime
    -0.46
    late
    -0.44
    CharCode
    -0.44
    Dmit
    -0.44
    Nok
    -0.44
    POSITIVE LOGITS
     museum
    1.94
     Museum
    1.84
    Museum
    1.74
     museums
    1.69
     MUSEUM
    1.68
    museum
    1.62
     Museums
    1.48
    Museums
    1.36
     museo
    1.30
     museu
    1.27
    Act Density 0.002%

    No Known Activations