INDEX
Explanations
phrases or comparisons indicating superiority or similarity in comparison to something else
phrases that compare two entities or concepts
New Auto-Interp
Negative Logits
Ô
-0.64
ende
-0.64
agre
-0.61
livest
-0.61
reperto
-0.61
accordingly
-0.61
pestic
-0.60
ertodd
-0.59
remem
-0.59
phabet
-0.59
POSITIVE LOGITS
course
1.09
icial
0.84
sorts
0.81
course
0.76
ours
0.75
ramer
0.75
tains
0.71
rame
0.64
ordinary
0.63
mund
0.61
Activations Density 0.083%