INDEX
Explanations
phrases related to scientific proposals and mechanisms
scientific and generalization terms
New Auto-Interp
Negative Logits
natomiast
-0.45
lisäksi
-0.34
murni
-0.34
bowiem
-0.32
diejenigen
-0.30
Königin
-0.29
pimpinan
-0.29
semula
-0.28
directement
-0.27
jenigen
-0.27
POSITIVE LOGITS
хьтан
0.83
ſicht
0.82
iſchen
0.81
ſeines
0.81
zwiſchen
0.80
ſelben
0.79
iſche
0.79
deſſen
0.79
NameInMap
0.78
daysTop
0.78
Activations Density 0.035%