INDEX
Explanations
negative statements and expressions of doubt regarding opinions, parties, and entities
New Auto-Interp
Negative Logits
-0.78
featureID
-0.75
RenderAtEndOf
-0.72
AssemblyCulture
-0.70
IndentedString
-0.69
ujednoznacz
-0.68
propOrder
-0.68
ligiloj
-0.68
beginnetje
-0.67
IsMutable
-0.67
POSITIVE LOGITS
none
0.67
nenhum
0.58
nadie
0.58
ninguno
0.58
none
0.57
nobody
0.57
ninguém
0.56
None
0.55
ninguna
0.54
żad
0.54
Activations Density 0.262%