INDEX
Explanations
mentions of influential figures, particularly popes and historical figures
New Auto-Interp
Negative Logits
ital
-0.17
worthy
-0.15
consult
-0.15
mal
-0.14
antan
-0.14
canonical
-0.14
ukan
-0.14
olland
-0.14
ural
-0.14
isma
-0.14
POSITIVE LOGITS
asin
0.15
ast
0.14
orca
0.14
ecies
0.14
(CultureInfo
0.14
ilin
0.14
æĸĻ
0.14
ugen
0.14
uj
0.14
sembly
0.14
Activations Density 0.011%