INDEX
    Explanations

    descriptions of vehicles, specifically mentioning colors and models

    mentions of specific car brands and models

    New Auto-Interp
    Negative Logits
    terness
    -0.78
    vironment
    -0.76
    hyde
    -0.72
    nown
    -0.72
    utenberg
    -0.71
    population
    -0.69
    Percent
    -0.69
    aghd
    -0.66
    ptroller
    -0.66
     thanking
    -0.65
    POSITIVE LOGITS
     MacBook
    1.38
     BMW
    1.24
     laptops
    1.24
     Jaguar
    1.22
     iPhones
    1.21
     iP
    1.18
     smartphones
    1.18
     convertible
    1.17
     laptop
    1.17
     automobiles
    1.16
    Act Density 0.803%

    No Known Activations