INDEX
    Explanations

    references to locations, universities, and organizations

    names of locations and institutions, particularly in relation to France and Berkeley

    New Auto-Interp
    Negative Logits
    Äĩ
    -0.78
    sterdam
    -0.73
    iland
    -0.71
    aceae
    -0.71
    abad
    -0.69
    lio
    -0.68
    atem
    -0.67
    omore
    -0.65
    wake
    -0.64
    conn
    -0.63
    POSITIVE LOGITS
     Cable
    0.71
    Arcade
    0.62
    ãĥ¼ãĥĨ
    0.61
     Levant
    0.61
     Falcon
    0.61
    çİĭ
    0.59
     Ultron
    0.59
    ij士
    0.58
     Asgard
    0.58
     Viper
    0.57
    Act Density 0.428%

    No Known Activations