INDEX
Explanations
references to famous books, fictional characters, and political figures
New Auto-Interp
Negative Logits
*/(
-0.83
eros
-0.82
uably
-0.81
iasco
-0.76
Helpful
-0.71
fficient
-0.70
ebin
-0.70
USD
-0.68
arbon
-0.67
ctrl
-0.67
POSITIVE LOGITS
Admir
0.77
cliffe
0.76
Sovereign
0.74
Rothschild
0.74
Prayer
0.73
Duchess
0.68
Card
0.67
Majesty
0.67
assies
0.67
Lann
0.66
Activations Density 16.550%