INDEX
Explanations
comparative phrases that indicate a preference or contrast between entities
New Auto-Interp
Negative Logits
refiere
-0.55
astify
-0.51
shadowOpacity
-0.50
sendok
-0.49
있어
-0.49
featureID
-0.47
tepat
-0.47
하십시오
-0.45
HORE
-0.44
pères
-0.44
POSITIVE LOGITS
themſelves
0.81
myſelf
0.80
itſelf
0.80
poffible
0.79
Jefus
0.78
himſelf
0.78
ſelves
0.77
whoſe
0.77
leſs
0.76
becauſe
0.75
Activations Density 0.459%