INDEX
Explanations
proper nouns or named entities with a focus on names and titles
specific names or proper nouns, particularly related to individuals and organizations
New Auto-Interp
Negative Logits
Reply
-0.80
commissions
-0.62
bearer
-0.61
reckoning
-0.60
autistic
-0.59
veland
-0.59
CPR
-0.57
cryst
-0.57
ply
-0.56
amber
-0.56
POSITIVE LOGITS
bilt
0.88
export
0.80
sov
0.79
rius
0.73
amins
0.73
ovich
0.72
Leaks
0.71
kov
0.71
iev
0.69
ennes
0.69
Activations Density 0.217%