INDEX
Explanations
phrases indicating contradictions or clarifications
New Auto-Interp
Negative Logits
openg
-0.40
legale
-0.40
FIS
-0.38
OCCURRED
-0.38
ต่อ
-0.38
nościo
-0.38
ainda
-0.37
AsUp
-0.37
usul
-0.36
nabla
-0.36
POSITIVE LOGITS
oredCriteria
0.81
oprot
0.75
Попис
0.71
estekak
0.71
Kanpo
0.69
propOrder
0.65
ukunft
0.64
ProtoMessage
0.63
NSCoder
0.63
(
0.62
Activations Density 0.337%