INDEX
Explanations
references to historical figures and their contributions
Preceding titles of famous people
eminent figures
New Auto-Interp
Negative Logits
Aholisi
-0.50
kope
-0.50
adpleegd
-0.44
Handlung
-0.44
ướ
-0.43
disambiguazione
-0.42
όμε
-0.42
testens
-0.42
AssemblyCompany
-0.42
freshman
-0.41
POSITIVE LOGITS
legendary
1.05
eminent
0.96
legendary
0.95
influential
0.95
genius
0.93
greats
0.93
figures
0.92
great
0.91
famous
0.91
figure
0.90
Activations Density 0.390%