INDEX
Explanations
proper nouns, likely related to names of people or places
names or titles of individuals
New Auto-Interp
Negative Logits
âĶĢâĶĢ
-0.75
equivalent
-0.72
AAA
-0.71
Genesis
-0.66
EDITION
-0.66
crate
-0.66
lounge
-0.65
Thumbnails
-0.65
Highlander
-0.65
Cerberus
-0.64
POSITIVE LOGITS
inski
1.27
acci
1.19
insky
1.19
auer
1.17
ansky
1.16
enson
1.15
kowski
1.13
anski
1.11
zel
1.10
iani
1.09
Activations Density 0.417%