INDEX
Explanations
phrases related to measurements and comparisons in various contexts
New Auto-Interp
Negative Logits
queſta
-1.02
zwiſchen
-0.82
dieſes
-0.82
estekak
-0.81
propOrder
-0.80
enderror
-0.78
tagHelperRunner
-0.77
ſammen
-0.76
exitRule
-0.76
ſchen
-0.75
POSITIVE LOGITS
the
2.47
the
0.84
The
0.84
The
0.74
את
0.47
的
0.36
rethe
0.35
הה
0.35
teh
0.34
sthe
0.33
Activations Density 15.975%