INDEX
Explanations
phrases related to standard or typical operations or conditions
terms related to conventional norms and standards
New Auto-Interp
Negative Logits
psey
-0.76
ificantly
-0.73
amia
-0.72
untarily
-0.70
orously
-0.68
initely
-0.65
asca
-0.64
zilla
-0.63
leased
-0.63
antically
-0.62
POSITIVE LOGITS
fare
0.95
greeting
0.89
decency
0.82
procedure
0.74
Occupations
0.72
tropes
0.70
ItemImage
0.68
iquette
0.67
hello
0.66
ensical
0.66
Activations Density 0.474%