INDEX
    Explanations

    references to specific political figures and events related to Brexit negotiations

    New Auto-Interp
    Head Attr Weights
    0:0.21
    1:0.03
    2:0.12
    3:0.10
    4:0.04
    5:0.03
    6:0.02
    7:0.01
    8:0.14
    9:0.07
    10:0.04
    11:0.13
    Negative Logits
    rep
    -1.56
     660
    -1.49
     760
    -1.47
     (<
    -1.41
    awaru
    -1.39
    eton
    -1.37
     VIS
    -1.36
     spons
    -1.35
    Tenn
    -1.34
    rouse
    -1.33
    POSITIVE LOGITS
     magnets
    1.57
     molecules
    1.50
    TAG
    1.49
    esters
    1.47
    yang
    1.47
     neurons
    1.45
    ドラゴン
    1.45
     algorithms
    1.43
    1.41
    -|
    1.39
    Act Density 0.000%

    No Known Activations