INDEX
Explanations
references to duality or pairs in various contexts
New Auto-Interp
Negative Logits
ต่างๆ
-0.56
Various
-0.51
vários
-0.49
semua
-0.48
jenigen
-0.48
Various
-0.48
antaranya
-0.48
pelbagai
-0.47
allemaal
-0.47
berbagai
-0.47
POSITIVE LOGITS
sides
1.44
sexes
1.25
sides
1.14
parties
1.04
halves
1.01
genders
1.00
ends
0.98
Sides
0.90
Sides
0.88
kinds
0.86
Activations Density 0.220%