INDEX
Explanations
phrases related to social criticism and satire
conjunctions and phrases that indicate relationships or connections between ideas
New Auto-Interp
Negative Logits
MU
-0.92
<[
-0.82
Place
-0.80
Effect
-0.78
Both
-0.78
matter
-0.76
Register
-0.75
amount
-0.75
kind
-0.74
emen
-0.73
POSITIVE LOGITS
bloated
1.13
flashy
1.13
occasional
1.11
assorted
1.11
endless
1.10
slick
1.09
quirky
1.07
relentless
1.05
soaring
1.05
scant
1.03
Activations Density 0.353%