INDEX
    Explanations

    the most, oldest, best, earliest, least

    New Auto-Interp
    Negative Logits
    Everything
    0.91
     دوسرا
    0.88
     entsprechende
    0.87
     ryzy
    0.86
    Needless
    0.84
     optimale
    0.83
    Everyone
    0.82
     zoals
    0.82
     oczywiście
    0.81
    Another
    0.80
    POSITIVE LOGITS
     few
    1.81
     top
    1.20
    few
    1.20
     wenigen
    1.18
     oldest
    1.17
     hallmarks
    1.16
    几个
    1.15
     earliest
    1.13
     reasons
    1.12
     Few
    1.11
    Act Density 0.131%

    No Known Activations