INDEX
    Explanations

    names of luxury car brands and car-related terms

    New Auto-Interp
    Negative Logits
    Reviewer
    -0.46
     partName
    -0.44
     attribute
    -0.43
     hindsight
    -0.42
     outweigh
    -0.42
     prompt
    -0.42
     worthwhile
    -0.41
     convenient
    -0.40
     rewarding
    -0.40
     carefully
    -0.39
    POSITIVE LOGITS
     etc
    0.62
    Tel
    0.50
    etc
    0.49
     TBA
    0.48
    undai
    0.48
    areth
    0.46
    uania
    0.46
    omy
    0.45
    ãĤ±
    0.45
    aches
    0.44
    Act Density 0.382%

    No Known Activations