INDEX
Explanations
phrases indicating agreement or similarity
instances of the word "alike."
New Auto-Interp
Negative Logits
ohyd
-0.68
someone
-0.68
Georg
-0.68
](
-0.65
OLOG
-0.65
Penal
-0.63
Ô
-0.63
Ö¼
-0.63
Emb
-0.63
olid
-0.62
POSITIVE LOGITS
alike
1.23
lihood
1.12
sexes
0.89
minded
0.82
soever
0.79
nodd
0.74
!--
0.73
sheets
0.71
minded
0.69
fascinated
0.69
Activations Density 0.009%