INDEX
Explanations
proper nouns related to characters or entities in a narrative
New Auto-Interp
Negative Logits
chu
-0.75
originate
-0.72
Interested
-0.70
Aware
-0.69
eem
-0.67
riz
-0.66
ر
-0.66
starter
-0.64
ND
-0.63
uese
-0.63
POSITIVE LOGITS
neglected
0.72
deserted
0.71
itness
0.71
abandoned
0.67
ffield
0.66
ignored
0.65
drowned
0.64
forgotten
0.64
langu
0.63
fle
0.63
Activations Density 0.300%