INDEX
    Explanations

    occurrences of the word "first."

    New Auto-Interp
    Negative Logits
    aul
    -0.17
     periods
    -0.14
    -age
    -0.14
     Epoch
    -0.14
     Period
    -0.14
    erge
    -0.14
    essen
    -0.14
     sm
    -0.14
    ara
    -0.13
    ider
    -0.13
    POSITIVE LOGITS
     time
    0.29
    	time
    0.20
    .time
    0.20
     vez
    0.20
    time
    0.20
     keer
    0.19
    æĹ¶éĹ´
    0.18
     fois
    0.17
     TIME
    0.17
     ÏĨοÏģ
    0.17
    Act Density 0.012%

    No Known Activations