INDEX
Explanations
negative descriptions of various concepts or situations
references to the science fiction genre
New Auto-Interp
Negative Logits
Duchess
-0.73
Clerk
-0.65
Holt
-0.64
Clintons
-0.64
Roose
-0.62
Downing
-0.62
ADRA
-0.62
blush
-0.62
Rost
-0.62
Chains
-0.62
POSITIVE LOGITS
fi
1.26
sci
1.14
Fi
1.00
fiction
0.98
tech
0.92
ê
0.89
fi
0.85
inspired
0.84
technical
0.83
engineering
0.81
Activations Density 0.022%