INDEX
    Explanations

    the word "Carleton" with a very high activation level

    references to the name "Carleton."

    New Auto-Interp
    Negative Logits
     Su
    -0.75
     word
    -0.66
    é¾įå¥ij士
    -0.59
     ratings
    -0.58
     controlled
    -0.57
     compensated
    -0.57
     worldwide
    -0.57
     term
    -0.57
     warrants
    -0.57
     fre
    -0.56
    POSITIVE LOGITS
    leton
    4.55
    letal
    1.76
    erton
    1.21
    legate
    1.19
    lington
    1.16
    alore
    1.08
    let
    1.07
    negie
    1.07
    erella
    1.03
    illac
    1.02
    Act Density 0.020%

    No Known Activations