INDEX
    Explanations

    phrases related to measurements and comparisons in various contexts

    New Auto-Interp
    Negative Logits
     queſta
    -1.02
     zwiſchen
    -0.82
     dieſes
    -0.82
     estekak
    -0.81
     propOrder
    -0.80
    enderror
    -0.78
    tagHelperRunner
    -0.77
    ſammen
    -0.76
    exitRule
    -0.76
    ſchen
    -0.75
    POSITIVE LOGITS
     the
    2.47
    the
    0.84
    The
    0.84
     The
    0.74
     את
    0.47
    0.36
    rethe
    0.35
     הה
    0.35
     teh
    0.34
    sthe
    0.33
    Act Density 15.975%

    No Known Activations