INDEX
Explanations
proper nouns or titles given to entities or objects
phrases that include the word "dubbed" or similar synonyms indicating a label or title
New Auto-Interp
Negative Logits
ramid
-0.66
Admin
-0.64
Admin
-0.64
UI
-0.63
istg
-0.63
=-=-=-=-
-0.61
emic
-0.60
iment
-0.59
chedel
-0.58
=-=-
-0.58
POSITIVE LOGITS
dubbed
0.75
fair
0.74
selves
0.74
"#
0.72
phas
0.71
geon
0.71
"@
0.70
é¾
0.68
comings
0.68
iously
0.68
Activations Density 0.022%