INDEX
    Explanations

    frequency and temporal conditions

    New Auto-Interp
    Negative Logits
    ્સ
    3.49
    ted
    2.91
    ل
    2.90
    tive
    2.89
     aument
    2.72
    ेबल
    2.60
    χρι
    2.44
    sj
    2.43
     einiger
    2.43
    ую
    2.38
    POSITIVE LOGITS
    %%%%%%%%%%%%%%%%
    3.32
    %%%%
    3.05
    e
    2.85
    %%%%%%%%%%%%
    2.84
    ce
    2.79
    ся
    2.77
    2.72
    %%%%%%%%
    2.71
    zijde
    2.69
    i
    2.65
    Act Density 0.058%

    No Known Activations