INDEX
Explanations
names of individuals
proper nouns, particularly names and titles
New Auto-Interp
Negative Logits
zsche
-0.71
ĸļ
-0.68
minecraft
-0.63
assetsadobe
-0.62
sear
-0.60
govtrack
-0.60
breakthrough
-0.60
sap
-0.60
kefeller
-0.59
Dhabi
-0.58
POSITIVE LOGITS
acter
0.81
auga
0.80
Cola
0.72
abbage
0.71
ãĤ¶
0.68
atro
0.68
onential
0.65
oglu
0.65
inctions
0.63
Clay
0.61
Activations Density 0.294%