INDEX
Explanations
phrases indicating causation or conditions of existence
New Auto-Interp
Negative Logits
<<<<<<<<<<<<<<
-0.69
̈́
-0.69
كويكب
-0.66
שוליים
-0.65
RotationOrder
-0.65
+#+#
-0.60
SharedCtor
-0.60
Datuak
-0.60
ंदीखरीदारी
-0.59
findpost
-0.58
POSITIVE LOGITS
esser
0.32
étant
0.32
being
0.31
JsonResponse
0.31
estar
0.30
事儿
0.30
PATENT
0.29
patent
0.29
ότι
0.29
escort
0.29
Activations Density 0.087%