INDEX
    Explanations

    names of institutions, people, and notable events or phenomena

    New Auto-Interp
    Negative Logits
    icka
    -0.74
    ¥µ
    -0.72
    pherd
    -0.71
    olulu
    -0.70
    ĪĴ
    -0.70
    olves
    -0.69
     renters
    -0.68
    eous
    -0.68
    iatus
    -0.68
    eric
    -0.66
    POSITIVE LOGITS
     Turing
    0.94
     Pharmaceutical
    0.86
     Britann
    0.75
    fer
    0.68
    Box
    0.67
     Gingrich
    0.67
     Clause
    0.67
    breaker
    0.66
     Machines
    0.66
     Test
    0.66
    Act Density 0.005%

    No Known Activations