INDEX
Explanations
structured lists and descriptions of collective entities or concepts
New Auto-Interp
Negative Logits
both
-0.26
Both
-0.22
Both
-0.22
both
-0.22
BOTH
-0.20
beide
-0.20
third
-0.17
tring
-0.17
_both
-0.16
両
-0.16
POSITIVE LOGITS
four
0.56
five
0.52
five
0.46
six
0.45
seven
0.43
four
0.41
bá»ijn
0.41
eight
0.40
ÑĩеÑĤÑĭ
0.39
cuatro
0.38
Activations Density 0.184%