INDEX
Explanations
statements or opinions embedded within commas or parentheses
phrases that discuss motives and actions related to groups or individuals in societal contexts
New Auto-Interp
Negative Logits
soType
-0.71
conom
-0.65
politics
-0.64
aminer
-0.64
road
-0.62
ascript
-0.62
Stats
-0.59
omics
-0.59
interrupted
-0.58
URL
-0.58
POSITIVE LOGITS
them
1.11
they
1.08
these
1.08
they
1.06
them
0.96
Their
0.95
their
0.92
They
0.92
their
0.91
these
0.90
Activations Density 0.690%