INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Seems
    0.55
     Seems
    0.55
     seeming
    0.52
    seems
    0.49
     seems
    0.49
    となります
    0.40
    似乎
    0.39
     parece
    0.39
     தெரியும்
    0.39
    ):=
    0.38
    POSITIVE LOGITS
     have
    0.89
     be
    0.82
     belong
    0.78
    have
    0.73
     haber
    0.71
     haberse
    0.70
     hayan
    0.63
     haben
    0.63
     want
    0.61
     haver
    0.61
    Act Density 0.027%

    No Known Activations