INDEX
    Explanations

    mentions of tolls or negative impacts in various contexts

    mentions of "toll" in various contexts, particularly relating to costs and impacts

    New Auto-Interp
    Negative Logits
    itals
    -0.82
     //[
    -0.81
     Correspond
    -0.73
    ital
    -0.68
    ITY
    -0.67
    ansion
    -0.66
     UNIVERS
    -0.65
    furt
    -0.65
     Marriage
    -0.63
    Craft
    -0.61
    POSITIVE LOGITS
     toll
    0.99
    s
    0.93
     booths
    0.88
    atures
    0.81
    sie
    0.77
     Toll
    0.76
    sg
    0.75
    thirst
    0.74
     booth
    0.74
     plaza
    0.73
    Act Density 0.018%

    No Known Activations