INDEX
Explanations
quotations within text
quoted statements or dialogue in the text
New Auto-Interp
Negative Logits
metic
-0.72
derby
-0.71
killer
-0.70
developmental
-0.68
rall
-0.66
reper
-0.66
nesting
-0.65
flared
-0.65
breakthrough
-0.64
cancell
-0.64
POSITIVE LOGITS
I
1.04
there
1.02
Jews
1.02
God
0.99
trust
0.98
straight
0.98
nothing
0.98
Where
0.97
double
0.96
Jewish
0.96
Activations Density 0.066%