INDEX
    Explanations

    transitions and additions

    New Auto-Interp
    Negative Logits
    0.48
    きっかけ
    0.42
    ത്രം
    0.42
    0.40
    却是
    0.40
     sauf
    0.39
     وہی
    0.38
    যদিও
    0.37
     aunque
    0.36
     хоть
    0.36
    POSITIVE LOGITS
     furthermore
    3.25
     moreover
    3.14
    Furthermore
    3.02
     Furthermore
    2.97
     inoltre
    2.88
     Moreover
    2.86
     additionally
    2.83
     Additionally
    2.81
    Moreover
    2.77
    Additionally
    2.75
    Act Density 0.123%

    No Known Activations