INDEX
    Explanations

    phrases related to being taken by surprise or experiencing unexpected events

    phrases that express emotional reactions or sentiments

    New Auto-Interp
    Negative Logits
     Slate
    -0.85
     Whedon
    -0.84
     BuzzFeed
    -0.78
     Isles
    -0.77
     Holt
    -0.76
     Moff
    -0.76
     Hyde
    -0.74
     Scott
    -0.73
     Doyle
    -0.72
     Huff
    -0.72
    POSITIVE LOGITS
     learnt
    1.48
     realised
    1.12
     envis
    1.05
     till
    1.03
    ´
    1.02
     realise
    1.02
     analysed
    1.00
    alore
    0.99
     organise
    0.99
    jriwal
    0.98
    Act Density 0.817%

    No Known Activations