INDEX
    Explanations

    numbers and calculations

    New Auto-Interp
    Negative Logits
     बुला
    0.43
    ন্যবাদ
    0.43
    thanks
    0.42
     உதவுக
    0.41
    Sharing
    0.39
     મદદ
    0.39
    0.39
    Wooden
    0.39
    udahkan
    0.39
    Watching
    0.38
    POSITIVE LOGITS
     macron
    0.43
     shift
    0.42
     Pareto
    0.41
     skyrocketed
    0.40
     anthropogenic
    0.39
     change
    0.39
     lament
    0.38
     skyrocketing
    0.38
     plummet
    0.37
     wilde
    0.37
    Act Density 0.005%

    No Known Activations