INDEX
Explanations
phrases related to instructions or directives
terms related to organizational structures and roles
New Auto-Interp
Negative Logits
entimes
-0.66
uristic
-0.63
sometimes
-0.62
ozy
-0.61
proverb
-0.61
slaught
-0.60
hua
-0.59
whel
-0.59
eka
-0.56
erick
-0.56
POSITIVE LOGITS
except
1.67
imaginable
1.27
except
1.21
alike
1.00
whatsoever
0.90
irrespective
0.88
including
0.86
together
0.85
sexes
0.85
Including
0.84
Activations Density 0.304%