INDEX
    Explanations

    names of researchers and scientists

    proper nouns, particularly names of researchers and their affiliations

    New Auto-Interp
    Negative Logits
     Hurricane
    -0.76
     mileage
    -0.75
     living
    -0.72
     prime
    -0.71
     continual
    -0.71
     Teddy
    -0.71
     totality
    -0.70
     creatively
    -0.70
     billboards
    -0.70
     successive
    -0.69
    POSITIVE LOGITS
    inav
    1.32
    essler
    1.31
    ohl
    1.27
    ijn
    1.26
    itsch
    1.25
    ijk
    1.22
    abet
    1.21
    atz
    1.21
    ör
    1.20
    idth
    1.20
    Act Density 0.163%

    No Known Activations