INDEX
Explanations
the name "Benedict" at varying strengths
references to notable individuals, particularly those in positions of power or influence
New Auto-Interp
Negative Logits
WAY
-0.90
WAYS
-0.90
ULT
-0.89
PER
-0.84
MIC
-0.82
BLE
-0.78
DAQ
-0.74
matically
-0.74
Ko
-0.74
matic
-0.74
POSITIVE LOGITS
Cumber
1.05
shire
0.97
itude
0.90
Schwarzenegger
0.85
XVI
0.85
éĹĺ
0.81
ured
0.80
ieth
0.79
sson
0.78
rophic
0.75
Activations Density 0.032%