INDEX
    Explanations

    instances of the word "while" typically used to introduce contrasting or conditional phrases

    New Auto-Interp
    Negative Logits
    vat
    -0.17
    ichen
    -0.15
    ant
    -0.15
    tra
    -0.14
    äºİæĺ¯
    -0.13
    ropy
    -0.13
    Tabs
    -0.13
    нед
    -0.13
    acies
    -0.13
    etc
    -0.13
    POSITIVE LOGITS
    s
    0.19
    ruba
    0.16
     initially
    0.15
    mailto
    0.15
    orget
    0.15
    snap
    0.14
    ough
    0.14
    tty
    0.14
    çł
    0.13
    sink
    0.13
    Act Density 0.034%

    No Known Activations