INDEX
Explanations
mentions of the word "lower" in various contexts
references to low or lower levels of measurement or quality
New Auto-Interp
Negative Logits
Pros
-0.68
andise
-0.66
aint
-0.66
³³³³³³³³
-0.65
Jew
-0.65
arya
-0.62
YES
-0.61
wip
-0.60
Ze
-0.60
ONSORED
-0.60
POSITIVE LOGITS
case
0.99
extrem
0.88
than
0.88
iating
0.86
inhib
0.85
iation
0.80
down
0.78
jaw
0.76
iated
0.75
levels
0.75
Activations Density 0.044%