INDEX
    Explanations

    phrases indicating the state or condition of something, particularly emphasizing the word "as."

    New Auto-Interp
    Negative Logits
    igrated
    -0.15
    ãģĿãģĵ
    -0.14
    ạnh
    -0.14
     Listed
    -0.14
    urtles
    -0.14
    åIJ«
    -0.13
    asurer
    -0.13
     souÄįástÃŃ
    -0.13
    ushima
    -0.13
    ses
    -0.13
    POSITIVE LOGITS
     indeed
    0.25
     happens
    0.22
     happened
    0.22
     is
    0.22
     was
    0.21
     proven
    0.21
     seen
    0.21
     att
    0.20
     can
    0.20
     are
    0.20
    Act Density 0.074%

    No Known Activations