INDEX
Explanations
German words and phrases amidst English context
New Auto-Interp
Negative Logits
SHIP
-0.44
MET
-0.41
CHR
-0.41
SC
-0.36
ministry
-0.35
MIT
-0.34
д
-0.34
SL
-0.34
DF
-0.33
strings
-0.33
POSITIVE LOGITS
ained
0.52
asing
0.52
urch
0.51
andise
0.51
arted
0.50
arak
0.47
okemon
0.47
atche
0.46
eneg
0.46
aining
0.46
Activations Density 6.536%