INDEX
    Explanations

    stumbled upon, found, encountered

    New Auto-Interp
    Negative Logits
    otard
    0.77
     cuantit
    0.72
     كافة
    0.72
    quantitative
    0.69
     quantitative
    0.67
     strives
    0.67
     prévoir
    0.65
     जाइए
    0.64
    selfish
    0.63
    муля
    0.61
    POSITIVE LOGITS
     discovered
    2.55
     stumbled
    2.40
     발견
    2.37
    发现
    2.25
    发现了
    2.25
     discovery
    2.23
     encountered
    2.20
    發現
    2.18
     discovering
    2.10
    discovered
    1.99
    Act Density 0.171%

    No Known Activations