INDEX
Explanations
names or references to individuals
names of individuals, particularly prominent figures and their connections
New Auto-Interp
Negative Logits
ista
-0.75
oats
-0.72
emonium
-0.71
istas
-0.71
Leaks
-0.69
CTV
-0.69
notations
-0.68
ing
-0.67
ingo
-0.66
llular
-0.62
POSITIVE LOGITS
bard
0.83
rosis
0.82
etr
0.80
chel
0.78
iat
0.77
proble
0.76
rich
0.75
itability
0.71
rod
0.70
nil
0.69
Activations Density 0.052%