INDEX
    Explanations

    phrases related to scientific proposals and mechanisms

    scientific and generalization terms

    New Auto-Interp
    Negative Logits
     natomiast
    -0.45
     lisäksi
    -0.34
     murni
    -0.34
     bowiem
    -0.32
     diejenigen
    -0.30
     Königin
    -0.29
     pimpinan
    -0.29
     semula
    -0.28
     directement
    -0.27
    jenigen
    -0.27
    POSITIVE LOGITS
    хьтан
    0.83
    ſicht
    0.82
    iſchen
    0.81
     ſeines
    0.81
     zwiſchen
    0.80
    ſelben
    0.79
    iſche
    0.79
     deſſen
    0.79
    NameInMap
    0.78
     daysTop
    0.78
    Act Density 0.035%

    No Known Activations