INDEX
    Explanations
    New Auto-Interp
    Head Attr Weights
    0:0.07
    1:0.07
    2:0.08
    3:0.09
    4:0.08
    5:0.08
    6:0.08
    7:0.06
    8:0.09
    9:0.08
    10:0.09
    11:0.07
    Negative Logits
    blers
    -2.61
    Labour
    -2.61
    escape
    -2.59
    soDeliveryDate
    -2.56
     traveller
    -2.54
     Flavoring
    -2.47
     Regulations
    -2.44
     NSW
    -2.43
     Riders
    -2.43
     realised
    -2.34
    POSITIVE LOGITS
     Bosnia
    2.87
     Arist
    2.71
     tyrann
    2.55
     Hispan
    2.50
     Serbia
    2.50
     Kosovo
    2.49
     olig
    2.48
     dictators
    2.28
     Serbian
    2.28
     Marcos
    2.26
    Act Density 0.000%

    No Known Activations