INDEX
Explanations
subordinate clauses beginning with "that"
New Auto-Interp
Negative Logits
backer
-0.95
Detailed
-0.75
Guard
-0.72
stal
-0.71
anie
-0.68
horn
-0.68
cellaneous
-0.67
rawdownloadcloneembedreportprint
-0.66
Details
-0.66
details
-0.66
POSITIVE LOGITS
someday
0.89
thood
0.86
homosexuality
0.84
humans
0.77
rationality
0.76
immutable
0.75
mankind
0.74
there
0.74
sexuality
0.73
morality
0.72
Activations Density 0.078%