INDEX
Explanations
phrases related to specific names and entities
proper nouns, particularly names of individuals and entities
New Auto-Interp
Negative Logits
croft
-0.74
creen
-0.74
ships
-0.71
rising
-0.71
gow
-0.68
yy
-0.66
eer
-0.66
visors
-0.65
metics
-0.64
arity
-0.63
POSITIVE LOGITS
Thib
1.07
Babe
0.82
oland
0.77
âĵĺ
0.74
Canaver
0.74
izont
0.73
Spur
0.70
odka
0.70
pering
0.70
Dug
0.69
Activations Density 0.023%