INDEX
Explanations
comparative phrases indicating contrasts or differences between subjects
New Auto-Interp
Negative Logits
ungkinkan
-0.68
bezeichneter
-0.63
erweise
-0.61
jména
-0.61
://"
-0.60
ranean
-0.58
'
-0.57
يئة
-0.57
邦
-0.56
}'
-0.55
POSITIVE LOGITS
vs
1.85
versus
1.66
Vs
1.54
Versus
1.50
versus
1.42
vs
1.35
Versus
1.34
VS
1.33
Against
1.33
Against
1.32
Activations Density 0.138%