INDEX
    Explanations

    not about perfection or franticness

    New Auto-Interp
    Negative Logits
     basic
    0.93
     simple
    0.88
     genuinely
    0.84
    basic
    0.81
     truly
    0.79
     pure
    0.79
     true
    0.78
     simplement
    0.78
     genuine
    0.78
     simply
    0.78
    POSITIVE LOGITS
    ellten
    0.92
    0.89
    olução
    0.87
     एमएन
    0.86
    стояние
    0.85
     Conform
    0.84
    0.84
     прави
    0.84
    特別な
    0.83
     принадле
    0.83
    Act Density 0.105%

    No Known Activations