INDEX
Explanations
occurrences and references to the number two or three in contexts related to entities or groups
New Auto-Interp
Negative Logits
Several
-0.14
-valu
-0.14
ophil
-0.14
åŃĺäºİ
-0.14
several
-0.14
veral
-0.13
../../
-0.13
ena
-0.13
vers
-0.13
olog
-0.12
POSITIVE LOGITS
teenth
0.26
remaining
0.26
aforementioned
0.25
remaining
0.23
most
0.22
latest
0.21
-legged
0.21
newest
0.21
amigos
0.21
most
0.20
Activations Density 0.117%