INDEX
Explanations
phrases indicating a requirement or directive for a group of people
references to groups or categories of individuals and entities
New Auto-Interp
Negative Logits
edia
-0.76
itta
-0.68
tch
-0.64
instein
-0.63
ahime
-0.63
agger
-0.62
odcast
-0.61
epad
-0.60
Braun
-0.59
proverb
-0.59
POSITIVE LOGITS
except
1.54
except
1.15
imaginable
1.08
whatsoever
1.06
soever
1.04
alike
1.01
irrespective
0.95
facets
0.85
regardless
0.83
sexes
0.80
Activations Density 0.258%