INDEX
Explanations
mentions of the name "Stein" and its variations
New Auto-Interp
Negative Logits
es
-0.23
a
-0.19
eat
-0.17
esz
-0.16
ters
-0.16
quina
-0.15
ent
-0.15
ome
-0.15
olor
-0.15
pie
-0.15
POSITIVE LOGITS
hardt
0.21
forcements
0.19
rdf
0.19
rough
0.17
bach
0.17
forcement
0.16
kea
0.16
berg
0.15
ldr
0.15
ENDOR
0.14
Activations Density 0.008%