INDEX
Explanations
proper nouns, particularly names of people and organizations
names after "both" or "respectively"
New Auto-Interp
Negative Logits
and
-0.65
性和
-0.63
力和
-0.59
Ecotoxicity
-0.54
子和
-0.49
noDo
-0.48
人和
-0.43
المناصب
-0.42
rungsseite
-0.41
كومونز
-0.40
POSITIVE LOGITS
respectively
0.68
begge
0.66
respectivamente
0.66
alike
0.62
ambos
0.59
keduanya
0.56
båda
0.56
Ambos
0.55
respectively
0.54
दोनों
0.53
Activations Density 0.127%