INDEX
Explanations
references to disparities or differences in various contexts, particularly focusing on gaps
New Auto-Interp
Negative Logits
zas
-0.16
bourg
-0.15
entifier
-0.15
hti
-0.14
aukee
-0.14
deg
-0.14
ÃŃsticas
-0.14
ÏĦÏģο
-0.14
lobe
-0.14
ESC
-0.13
POSITIVE LOGITS
between
0.21
gap
0.20
междÑĥ
0.19
/loose
0.19
between
0.18
-gap
0.18
gap
0.17
Between
0.17
Between
0.17
zwischen
0.17
Activations Density 0.042%