INDEX
Explanations
phrases or terms that indicate classification or characterization
New Auto-Interp
Negative Logits
policías
-0.52
peuple
-0.47
nestjs
-0.46
américa
-0.39
people
-0.39
politiet
-0.38
obé
-0.37
américains
-0.37
zákon
-0.36
centerY
-0.35
POSITIVE LOGITS
a
0.65
незавершена
0.64
AssemblyCompany
0.64
invokingState
0.61
视为
0.59
awtextra
0.59
extAlignment
0.59
nakalista
0.58
IsContent
0.57
an
0.56
Activations Density 0.507%