INDEX
    Explanations

    expressions emphasizing the term "so" in various contexts

    New Auto-Interp
    Negative Logits
    pane
    -0.16
    nty
    -0.15
    eous
    -0.15
    more
    -0.15
    apolis
    -0.15
    моÑĢ
    -0.15
    phan
    -0.14
    bies
    -0.14
    gram
    -0.14
    incy
    -0.14
    POSITIVE LOGITS
    -called
    0.42
     much
    0.36
    oo
    0.33
    apy
    0.33
    oooo
    0.32
    iled
    0.32
    ooo
    0.30
    oth
    0.29
    ars
    0.28
    aks
    0.28
    Act Density 0.055%

    No Known Activations