INDEX
    Explanations

    mentions of a specific brand or company name

    New Auto-Interp
    Negative Logits
     Herrick
    -0.74
     McLaugh
    -0.68
    <bos>
    -0.67
     Unger
    -0.66
     McFar
    -0.64
     Kearns
    -0.64
     McInt
    -0.64
     Kruse
    -0.57
     Hickey
    -0.57
     inform
    -0.57
    POSITIVE LOGITS
     Ottobre
    1.59
     Baldwin
    1.58
     broder
    1.54
     Settembre
    1.49
     cannes
    1.49
     marseille
    1.48
     Traité
    1.42
     Luglio
    1.41
     tyn
    1.41
     chèvre
    1.39
    Act Density 0.299%

    No Known Activations