INDEX
    Explanations

    phrases indicating time spent or frequency of activities

    New Auto-Interp
    Negative Logits
    undler
    -0.16
    ICENSE
    -0.15
    \grid
    -0.14
     Dalton
    -0.14
     Manor
    -0.14
    casting
    -0.14
    it
    -0.13
    raci
    -0.13
    anth
    -0.13
    xfb
    -0.13
    POSITIVE LOGITS
    ãĥªãĥ¼ãĤº
    0.14
    endi
    0.14
    ovich
    0.14
    roit
    0.14
    agi
    0.14
    çļĦåľ°
    0.14
    gı
    0.13
     ÃĤu
    0.13
     firefight
    0.13
    lace
    0.13
    Act Density 0.040%

    No Known Activations