INDEX
    Explanations

    repeated or recurring phrases

    New Auto-Interp
    Negative Logits
    ãĤĪãģĨãģª
    -0.19
    ãģĬ
    -0.19
    人æ°Ĺ
    -0.17
    ä¸ĢåĪĩ
    -0.16
     Äijây
    -0.16
    åŃIJä¾Ľ
    -0.16
    umber
    -0.15
    大
    -0.15
    ä¸Ģ
    -0.15
    ListOf
    -0.15
    POSITIVE LOGITS
     sorts
    0.40
    course
    0.34
     course
    0.28
       
    0.26
    -course
    0.26
    vido
    0.25
    /from
    0.23
    /by
    0.22
    lox
    0.21
    ftime
    0.20
    Act Density 1.823%

    No Known Activations