INDEX
Explanations
terms indicating comparisons or relationships between items in a sequential or ordered context
New Auto-Interp
Negative Logits
Dene
-0.62
nothing
-0.59
GW
-0.59
Phrase
-0.58
Huck
-0.58
ாத
-0.57
'>
-0.57
IOL
-0.57
FL
-0.57
GW
-0.56
POSITIVE LOGITS
respectively
1.04
respectively
0.99
respectivamente
0.84
pective
0.84
masing
0.78
respective
0.77
respectivement
0.76
تضيفلها
0.75
Gruber
0.74
EndContext
0.74
Activations Density 0.113%