INDEX
Explanations
mentions of specific people or entities in a comparative context
phrases that indicate references or comparisons involving "the likes of" someone or something
New Auto-Interp
Negative Logits
rang
-0.71
é¾įå¥ij士
-0.70
teasp
-0.69
ħĭ
-0.67
Ô
-0.67
proport
-0.67
INAL
-0.67
conduc
-0.66
IAL
-0.65
ĸļ
-0.64
POSITIVE LOGITS
ours
0.80
Rodriguez
0.70
those
0.68
Phillips
0.67
Neal
0.66
Rut
0.66
Julius
0.66
Henry
0.66
Franklin
0.65
Machina
0.65
Activations Density 0.060%